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Geometric Aspects of Gauge and Spacetime Symmetries 

Steffen Christian Martin Gielen 

Summary 

We investigate several problems in relativity and particle physics where symmetries 
play a central role; in all cases geometric properties of Lie groups and their quotients 
are related to physical effects. 

The first part is concerned with symmetries in gravity. We apply the theory of Lie 
group deformations to isometry groups of exact solutions in general relativity, relating 
the algebraic properties of these groups to physical properties of the spacetimes. We 
then make group deformation local, generalising deformed special relativity (DSR) 
by describing gravity as a gauge theory of the de Sitter group. We find that in our 
construction Minkowski space has a connection with torsion; physical effects of torsion 
seem to rule out the proposed framework as a viable theory. A third chapter discusses 
a formulation of gravity as a topological BP theory with added linear constraints 
that reduce the symmetries of the topological theory to those of general relativity. 
We discretise our constructions and compare to a similar construction by Plebanski 
which uses quadratic constraints. 

In the second part we study CP violation in the electroweak sector of the standard 
model and certain extensions of it. We quantify fine-tuning in the observed magnitude 
of CP violation by determining a natural measure on the space of CKM matrices, 
a double quotient of S'f/(3), introducing different possible choices and comparing 
their predictions for CP violation. While one generically faces a fine-tuning problem, 
in the standard model the problem is removed by a measure that incorporates the 
observed quark masses, which suggests a close relation between a mass hierarchy and 
suppression of CP violation. Going beyond the standard model by adding a left-right 
symmetry spoils the result, leaving us to conclude that such additional symmetries 
appear less natural. 
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Chapter 1 
Introduction 

"Ich kann es nun einmal nicht lassen, in diesem Drama von 
Mathematik und Physik - die sich im Dunkeln befruchten, 
aber von Angesicht zu Angesicht so gerne einander 
verkennen und verleugnen - die Rolle des (wie ich genugsam 
erfuhr, oft unerwiinschten ) Boten zu spielen. " 

Hermann Weyl 

It is impossible to overemphasise the importance of symmetry as a fundamental 
concept in both mathematics and theoretical physics. 

In mathematics, the study of groups arose as an abstraction of "transformation 
groups", invertible maps of a given "point field" into itself |164] . Groups are the 
mathematical realisation of the concept of symmetry. Felix Klein's Erlangen Pro- 
gramme [nS] aimed to describe any geometry by its transformation group; as we will 
detail shortly, the geometries considered in the Erlangen Programme are given by 
coset spaces of the form G/H = {gH,g G G}, where G is a continuous (Lie) group 
and H a closed subgroup of G. Klein himself found the concept of groups most char- 
acteristic of 19**^ century mathematics |164] . The study of continuous symmetries is, 
in mathematical terms, the study of Lie groups and homogeneous spaces which nicely 
combines aspects of group theory and differential geometry. Beyond the direct study 
of groups, symmetry is an important concept in other parts of mathematics, such as 
the study of differential equations. 

In physics, the underlying symmetries of any theory are arguably its most fun- 
damental property. As an example, the first physical theory in the modern sense, 
Newtonian mechanics, fundamentally rests on assumptions about the symmetries of 
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the physical world: A configuration of bodies in (gravitational) interaction will behave 
in the same way if it is rotated as a whole (space is isotropic); a common motion of 
all bodies at constant velocity is not observable (relativity of inertial frames). Con- 
sequently, the laws of Newtonian mechanics are invariant under the action of the 
semidirect product of the rotation group 5*0(3) and commutative Galileian boosts, 
the Euclidean group -E'(3). When Einstein understood in 1905, in the wake of the 
Michelson-Morley experiment [118j . that it was necessary to modify the concept of 
relativity of inertial frames, he did so by replacing the Euclidean group by the Lorentz 
group 5*0(3,1) as the fundamental symmetry group. We will see in chapter |2] that 
this process can be mathematically viewed as a Lie group (or algebra) deformation. 

The Galilean and Lorentz groups are examples of spacetime symmetries; their 
primary action on space (time) itself induces an action on all fields according to their 
tensorial structure. In general relativity, these global spacetime symmetries are made 
local, although global symmetries also appear in many exact solutions of physical 
interest, as we shall see in chapter [21 Both global and local spacetime symmetries 
will play a role in the first part of this thesis. 

More generally, the equations defining a given theory (or the action, in the usual 
case that the equations of motion can be derived from an action principle) may also 
be invariant under a group of transformations of the physical variables which is not 
associated to a transformation of spacetime. A simple example in field theory is that 
of a complex scalar field with equations of motion invariant under a global rotation 
(j) — )■ e^'^ip. Promoting such a global symmetry to a local symmetry leads to a gauge 
theory. We will discuss gauge symmetries in both gravity and particle physics in this 
thesis. 

Mathematically, the fundamental structures describing local symmetries are prin- 
cipal fibre bundles over a given manifold which is interpreted as spacetime, where 
the structure group becomes the "gauge group" of the theory, and a connection on 
such a bundle is interpreted as a gauge potential. The remaining fields that make up 
the physical content of the theory then live in associated vector and tensor bundles. 
While this underlying structure suggests a very close relationship of all fundamental 
interactions in Nature, there are important differences between Yang-Mills theory, 
defined in terms of a principal bundle with compact gauge group, and general rela- 
tivity, where in addition to the connection one has a frame field as a second structure 
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which is crucial for the geometric interpretation of the theoryjj In chapter [3l we will 
study a formulation of general relativity as a Yang-Mills-like theory which tries to 
exploit this analogy; the usual Lorentz algebra valued connection and the frame field 
are combined into a connection on a bundle of the Poincare group (or its deformation, 
the (anti-)de Sitter group). This study will elucidate some of the virtues as well as 
shortcomings of such a formulation. 

Symmetry is our most important guiding principle towards discovering the laws 
of Nature. In particle physics, the electroweak model j72| 11461 I161j unifying the 
electromagnetic and weak interactions was discovered by postulating the fundamen- 
tal symmetries and writing down the most general action invariant under them. A 
crucial step in completing the present picture of particle physics was the proof by 
't Hooft and Veltman [156] that the mathematical structure of Yang-Mills theory is 
compatible with the framework of quantum field theory; namely, Yang-Mills theory 
is renormalisable. By far most attempts to go beyond the standard model of par- 
ticle physics, such as supersymmetric theories and GUTs, are based on postulating 
new symmetries. Part of our analysis of chapter [H] will be to discuss possible observ- 
able consequences of such an existence of additional symmetries beyond those of the 
electroweak model. 

General relativity, however, has so far evaded a similar treatment in terms of 
conventional field theory formulated on a background Minkowski spacetime, and by 
now it seems unlikely that adding (super) symmetries will be sufficient for a consistent 
quantum-mechanical treatment, although there has recently been increasing interest 
in the possiblity that maximal (A^ = 8) supergravity in four dimensions is perturba- 
tively finite [22] • String theory is one attempt to go beyond standard quantum field 
theory by replacing the notion of point particles by extended objects; symmetries 
(such as dualities) play a fundamental theory in string theory as well. Besides the 
symmetries that will be important for a consistent fundamental theory of quantum 
gravity, if we manage to construct (and understand) one, one may wonder whether 
at low energy some effects of quantum gravity will be manifest in a deformation of 
spacetime symmetries. This idea, which has been proposed on a purely phenomeno- 
logical basis [1] as well as a consequence of developments in non-perturbative quantum 
gravity [B] , will be taken up in chapter [3J 

*A second difference is of course tlie non-compactness of tlie gauge group for Lorentzian signature, 
but this seems a less important difference to Yang-Mills theory in the basic structure of the theory 
- one often considers Riemannian signature as well. 
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The notion of symmetries we have discussed so far refers to invariance of the 
action or equations of motion under certain group actions. From a canonical view- 
point, there is another important class of symmetries, namely those generated by 
constraints. To obtain the canonical formulation of a given theory, one performs a 
Legendre transformation, leading to a Hamiltonian 

Mx) = , -H = y d'^-'xnix) = j d'^-'x {Mx)q\x) - C{x)) , (1.0.1) 

where S = J dt J d'^~^x C{x) is the action defining the theory, and a symplectic struc- 
ture on the phase space parametrised by {q\pi), 

{q'ix),p,iy)} = 5]5'-\x-y) , {q\x),q^{y)} = = 0. (1.0.2) 

The Hamiltonian density 'H(x) typically splits into a "dynamical" part which is a 
generic function of the Pi and g*, and a "constraint" part of the form fa{Pi-,<t)'i 
here A" are Lagrange multipliers, variables that appear in the original action whose 
canonical momentum is identically zero and that are conventionally not treated as be- 
longing to the phase space. The functions fa{Pi, q^) then act as constraints imposed 
on the dynamical variables. A subset of these will be first clast\ i.e. their Pois- 
son brackets vanish up to a linear combination of constraints. The Poisson brackets 
{fa,-} of the first class constraints then determine vector fields on the phase space 
whose integral curves are interpreted as describing physically equivalent configura- 
tions. Although not necessarily associated with connections on fibre bundles, the 
transformations on the phase space generated by these vector fields are, by their 
physical interpretation, usually interpreted as "gauge transformations" as well. We 
will see an instance of such a symmetry in the context of a topological field theory, 
which is then constrained by a term that is added to the action, in chapter |H 

Although our discussions of chapter H] will remain classical, the motivation for 
the reformulation of general relativity that we will consider comes from the fact 
that the quantisation of topological field theories is relatively well understood. One 
might therefore hope that starting from a topological quantum theory which is then 
constrained may provide a different route to quantum general relativity. This also 
provides a bridge to the discussion of chapter [3]in that both consider possible reformu- 
lations of general relativity at the classical level which are suggested by developments 
in non-perturbative approaches to quantising gravity. 



^We are here and in the foUowing using the terminology introduced by Dirac [46j . 
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In all of our discussions, an important distinction is that between discrete and 
continuous symmetries. The first systematic study of groups in physics was probably 
that of discrete groups describing the symmetries of crystals |164] . We shall be mainly 
interested in continuous symmetries which are described by Lie groups. The spacetime 
and gauge symmetries described above are of this type. Their different iable structure 
means that differential geometry can be used to study them. In particular, we will 
see that the group structure is encoded in a bracket operation on the tangent space 
to the group at the identity element. 

We will also encounter discrete symmetries, such as charge conjugation and parity 
in particle physics which are represented by the finite group Z2. These will not be 
studied mathematically, but their physical significance will be discussed. 

In the spirit of Hermann Weyl's self-image as a "messenger" between mathemat- 
ics and physics, we will highlight different examples where a study of the differential 
geometry of (spacetime or gauge) symmetry groups leads to important physical con- 
sequences. Let us first introduce the most important aspects of the theory of Lie 
groups and homogeneous spaces that will be important throughout this thesis. 



1.1 Lie Groups and Homogeneous Spaces 

The study of symmetries is the study of groups. If we consider continuous symme- 
tries, the corresponding groups have an additional differentiable structure: They are 
manifolds as well as groupqi|. As simple examples one may think of the groups of 
isometrics of M", or H", denoted by 0{n), U{n) and Sp{n) respectively, or more 
generally, of groups of matrices with continuous parameters which are submanifolds 
of R"^", C"^" or H"^". All examples of relevance in this thesis will be of this type. 

Such groups are known as Lie groups, named after the Norwegian mathemati- 
cian Sophus Lie (1842-1899) who initiated their systematic study in the context of 
systems of differential equations. Lie himself seems to have referred to Lie groups as 
"continuous groups" which describes appropriately what they are. The study of Lie 
groups combines aspects of differential geometry with group theory. 

Two natural actions of a Lie group G on itself are given by the left and right 

■I- One also requires the group multiplication map {x,y) x ■ y and the inversion map a; i~> a;^^ 
to be differentiable. 
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multiplication maps 



L:GxG^G, {h,g)^hg, R : G x G ^ G , {h,g)^gh. (1.1.1) 



By definition of a group these maps act simply transitively: Given a,g ^ G there 
is a unique element h such that L{h,g) = a and a unique element h' such that 
R{h',g) = a. The pull-back and push-forward of these maps act on the tangent and 
cotangent bundle of G. 

Starting with a basis of the tangent space at the identity element e, one can use 
the push-forward of left translation to define a basis of vector fields on G, known as 
left-invariant vector fields. These define a global frame field on G and hence every 
Lie group is parallelisable. (Right-invariant vector fields can be defined analogously 
but often left invariance is preferred.) The tangent space at e, or alternatively the 
set of left-invariant vector fields on G, is known as the Lie algebra q. It inherits a 
Lie bracket [•, ■] from the commutator of vector fields. For matrix Lie groups, the 
bracket is given by the commutator of matrices. 

Lie's important results are summarised in the following theorem [S]: 

Theorem 1.1.1 (Lie) 

(1) For any Lie algebra q there is a Lie group G (not necessarily unique) whose Lie 
algebra is g. 

(2) Let G be a Lie group with Lie algebra q. If H is a Lie subgroup of G with Lie 
algebra (], then ^ is a subalgebra of q. Conversely, for each Lie subalgebra f) of q, 
there exists a unique connected Lie subgroup H of G which has \] as its Lie algebra. 
Furthermore, normal subgroups of G correspond to ideals in q. 

(3) Let Gi,G2 be Lie groups with corresponding Lie algebras 01,02- Then if gi and 
02 are isomorphic as Lie algebras, then Gi and G2 are locally isomorphic. If the Lie 
groups Gi,G2 are simply connected, then Gi is isomorphic to G2. 

Loosely speaking, all of the structure of the group G is encoded in its Lie algebra. If 
one is not interested in the topological properties of G and considers only connected 
and simply connected Lie groups, one can study the geometry of G by studying 
which is just a linear vector space together with a bracket structure. 

For matrix Lie groups, it is usually most convenient to fix a basis of generators 
Ma and to expand the Maurer-Cartan form in terms of this basis: 



g-Ug = a''M, 



a 
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determines a basis of left-invariant one-forms a". Similarly, right-invariant forms can 
be obtained by computing dg which is clearly invariant under g v-^ gh. The basis 
dual to {cr"} is then a basis of left-invariant vector fields. By taking the exterior 
derivative of both sides of fll.l.2p and using d{g^^) = —g^^dgg^^ for matrices, one 
can show that the forms a"" satisfy the Maurer-Cartan relations 

da"" = -lCb\cr^ Ad", (1.1.3) 

where Cah are the structure constants of the Lie algebra defined by 

\M,,Mb\=Ca\M,. (1.1.4) 

One defines left-invariant tensors accordingly, e.g. left-invariant metrics as generated 
by symmetrised combinations of 0°". Any metric of the form 

g = Kbo''®o'' (1.1.5) 

for constant symmetric hah is by definition left-invariant. One can phrase this by 
saying that there is a one-to-one correspondence between scalar products on the Lie 
algebra g (specified by hah) and left-invariant metrics on G [9]. These metrics are the 
ones compatible with the left action on G on itself. 

Lie groups are in some sense special cases of a more general class of manifolds 
that will be important in the following: In many situations of physical interest, one 
considers a manifold X which a Lie group G acts on transitively, but not necessarily 
simply transitively. That is, for any x, x' G X, there is a (not necessarily unique) 
g E G with g ■ X = x'. (Again we take this to be a left action, the case of a right 
action is analogous.) Then X is called a homogeneous space. For a given x G X, 
the subgroup Gx = {g ^ G : g ■ x = x} leaving x invariant is called the stabiliser, 
little group or isotropy subgroup of x. Then we have the following result [9] 

Proposition 1.1.2 Let G x X — t- X be a transitive action of a Lie group G on a 
manifold X , and let H = G^ be the isotropy subgroup of a point x. Then: 

(a) The subgroup H is a closed subgroup of G. 

(b) The natural map j : G/H ^ X given by j{gH) = g ■ x is a diffeomorphism. In 
other words, the orbit G ■ x is diffeomorphic to G/H. 

(c) The dimension of G/H is dim G — dim if. 
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Since we assume transitivity of the group action, the orbit G ■ x is the whole of 
X; one can then identify X with the coset space G/H^ The choice of x in this 
identification is arbitrary and corresponds to a choice of origin in X; different choices 
for X lead to groups G^ related by conjugation. The natural left action of G on G/H 
is just induced by the group multiplication law. 

In many applications, an additional assumption is satisfied which simplifies the 
construction of metrics on a homogeneous space: A given homogeneous space G/H 
is called reductive if there is a splitting of Lie algebras of the form 

= [}©y (1.1.6) 

invariant under the adjoint action of H. Then the subspace y of the Lie algebra g is 
isomorphic |9j to the tangent space to G/H at H (or any other coset gH). It therefore 
defines a basis of left-invariant vector fields on X, and one can take the dual basis 
of a basis for y to construct left-invariant metrics on X. Reductive geometries will 
be of importance in chapter [3] (with G = SO{d, 1) and H = SO{d — 1,1)) and in 
chapters [5] and [6] (with G = SU{3) and H = U{lY). We will encounter an example 
of a non-reductive geometry in section 12. 4[ 

Homogeneous spaces are the simplest generalisations of (flat) Euclidean geom- 
etry: In the 19**^ century, attempts to deduce Euclid's fifth postulate, the parallel 
postulate [50], from the other fouiQ, led to the discovery of elliptic and hyperbolic 
geometries based on the sphere S*" ~ SO{n + 1)/S0{n) and hyperbolic space if" ~ 
SO{n,l)/ SO{n), the natural counterparts of Euchdean space ^ E{n) / SO{n). 
In Lorentzian geometry, the spaces of constant curvature, Minkowski, de Sitter and 
anti-de Sitter space, are also homogeneous spaces. Many other known exact solutions 
in general relativity are also homogeneous spaces; see |154[ chapter 12]. 

Homogeneous spaces appear whenever there are (gauge or spacetime) symmetry 
groups. Another example would be a Higgs potential spontaneously breaking a sym- 
metry group G down to a subgroup H; then the space of vacua would be G/H. 



^This result justifies dropping the distinction between the terms "coset space" and "homogeneous 

space" as we will do in the following. 

^About 30 such "proofs" were shown to be unsatisfactory in the 1763 dissertation of A. Kaestner's 
student Georg Simon Kliigel entitled "Conatuum praecipuorum theoriam parallelarum demonstrandi 
recensio" (Critique of the foremost attempts to prove the theory of parallels) [571 p. 274]. 
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1.2 Structure of this Thesis and Conventions 

The thesis consists of two main parts. 

The first part explores symmetries in relativity which are deformed or constrained. 
In chapter[2]we introduce deformation theory of Lie algebras and investigate its appli- 
cation to global symmetries (isometrics) of certain exact solutions to Einstein's equa- 
tions in four dimensions. We will see how the group-theoretic viewpoint of spacetimes 
in terms of their isometry groups relates to physical properties such as causality vio- 
lation. Section 12^3] contains calculations and results taken from |65] and are the result 
of a collaboration with Gary Gibbons, except subsection 12.3.61 which is new. We omit 
those parts of [IS] which are of less relevance to the discussion of symmetry groups. 
In chapter [3] we turn to a study of local symmetries in relativity, namely those of 
the tangent space. We investigate how Cartan geometry, a generalisation of Rieman- 
nian geometry where the flat tangent space is replaced by an arbitrary homogeneous 
space, can be used to generalise the proposed deformed special relativity (DSR) to 
a "deformed general relativity" in the same way that (pseudo-)Riemannian geome- 
try describes general relativity as an extension of special relativity. We analyse the 
physical predictions of a particular example supposedly describing Minkowski space, 
and note that it involves a connection with torsion potentially leading to disastrous 
predictions of charge nonconservation. This analysis serves as another example of the 
interplay between an algebraic viewpoint on (infinitesimal) symmetries and physical 
effects. The material presented in chapter [3] is mainly taken from ^6] which again 
resulted from collaboration with Gary Gibbons. In chapter H] we study yet another 
possible reformulation of classical general relativity as a topological BF theory with 
constraints. Such a formulation has been considered before and is of relevance in 
covariant approaches to quantising gravity, but we give a new version of it where the 
constraints are linear, removing some of the ambiguity in their solutions. We see how 
a very large symmetry on the phase space of the theory - the local equivalence of all 
solutions to the equations of motion - can be constrained to give general relativity, a 
theory with local degrees of freedom. The results detailed in chapter HI pubhshed in 
|69] . were obtained in collaboration with Daniele Oriti. 

The second part is concerned with gauge symmetries in particle physics and their 
influence on CP violation in the electroweak sector. We aim to give estimates for 
the likelihood of the observed magnitude of CP violation, measured by the Jarlskog 
invariant J, by constructing a natural measure on the space of Cabibbo-Kobayashi- 
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Maskawa (CKM) matrices. This space is a double quotient of SU{3) and does not 
have a clearly preferred measure. After introducing different choices for the measure 
in sectional motivated first only by geometric and then also by physical considerations 
- we will incorporate the observed quark masses into the analysis as well - in chapter 
[6] we make a detailed study of their physical predictions. We find that once the quark 
masses are assumed, predictions for CP violation agree well with observation. We 
then provide another example of how modified symmetries affect physical predictions 
by assuming an extended left-right symmetry given by a second SU (2) gauge group. 
This will influence our predictions on CP violation and lead to completely different 
results. The analysis for the standard model was done in collaboration with Gary 
Gibbons, Chris Pope and Neil Turok in |i67j. Much of the calculations presented in 
sections |6 . 3 1 and 1^7^ are taken from this paper; I have omitted some parts of where 
my original contribution has been limited. The calculations in subsection 16.4.21 are 
my own and were published in [68] . 

We close with a brief summary and outline possible directions for future research. 

Our conventions for general relativity are those of Hawking & Ellis; the signature of 
a Lorentzian metric is mainly positive; Greek letters are used for spacetime and Latin 
letters for internal indices. The curvature of a connection is always F = dA + A /\ A 
without any coupling constants as customary in some of the particle physics literature. 
In discussions of CP violation in particle physics we follow the book of Jarlskog [9T] . 
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Chapter 2 

Deformations of Spacetime 
Symmetries 

2.1 Overview 

The archetypal examples of groups describing spacetime symmetries in physics are the 
Galilean and Lorentz groups of classical mechanics and special relativity. The crucial 
insight of Einstein in 1905 was the necessity of replacing the Galilean group by the 
Lorentz group in order to have an invariant speed of light c; as we will see shortly, this 
process is also the archetypal example of a Lie algebra deformation. Historically, 
the development of a mathematical theory of deformation and contraction of Lie 
algebras was motivated by advances in theoretical physics, but it is at least conceivable 
that the replacement of the Galilean group by the Lorentz group had been suggested 
by mathematicians working on Lie group theory, instead of physicists like Einstein. 

Since physical theories are to some extent always idealised approximations of a 
much more complex Nature, the process of deformation of a symmetry group allows 
one to make statements about the stability of a given theory. This stability point 
of view was explored in detail in |116j. The passage from non-relativistic to special- 
relativistic mechanics can be understood as a deformation of an "unstable" to a 
"robust stable" theory |116] . The point here is that the zero commutators of Galilean 
boosts constitute "fine-tuning" and any perturbation of the commutation relations 
leads to noncommuting boosts. On the other hand, given noncommuting boosts, a 
slight perturbation of the coefficient appearing in the commutator can be undone by 
redefining the algebra generators, hence one has "robustness" in the theory. 
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In |116j . the possible central extension of the Poisson algebra of the phase-space 
coordinates of Hamiltonian classical mechanics, leading to quantum mechanics, was 
interpreted as a similar instability of the original algebra. It was then argued that 
although the resulting Heisenberg algebra admits a deformation, it can be undone by 
a nonlinear redefinition of the position coordinate. 

In this chapter, we shall focus on Lie algebra deformations, perturbations of the 
structure constants which can be expanded in a power series. We are maintaining the 
structure of a Lie algebra, i.e. of a vector space, and redefinitions of the generators 
will always be linear transformations. One could consider more general kinds of de- 
formations, where the Lie algebra structure is lost, either by allowing commutators of 
two generators to be nonlinear in the generators or by allowing "structure functions" . 

While the existence or non-existence of deformations of a given Lie algebra is a 
mathematically well-posed question, the physical interpretation of its answer is less 
clear in general. Both the generators and the deformation parameters of a given Lie 
algebra may have different possible interpretations. For example, while the Poincare 
group admits a deformation to the de Sitter or anti de Sitter groups, it was argued in 
|116] that the deformation parameter R, defining a length scale, may be taken to be 
infinity if one is only interested in local kinematics, which of course is what happens 
in general relativity. It is also clear that while linear transformations can formally be 
applied to the generators in the Lie algebra, it may not appear physically meaningful 
to take linear combinations of, for instance, translation and rotation generators in 
the Poincare algebra. This point was stressed in [S7\, who also pointed out that the 
interpretation of position operators appearing in a "kinematical algebra" acting on 
single- or multi-particle quantum states is physically rather unclear, since position is 
not additive in a way that momentum and angular momentum are. 

The classic cases of the de Sitter group and its contractions were studied in detail in 
|12j . as we will review shortly. One could argue that the mathematical theory behind 
deformations seems to have predictive power in terms of extensions or generalisations 
of fundamental theories, but these are only a posteriori predictions [37J. There seem 
to be two obvious classes of new applications for the concept of deformations of 
spacetime symmetries: 

• Situations where the relevant symmetry group is a subgroup of the Poincare 
group, or more generally a group of lower dimension than the Poincare group. 
One example for the former is the proposed "very special relativity" [39], where 
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the fundamental symmetry group is taken to be ISIM(n — 2), consisting of the 
maximal subgroup of the Lorentz group leaving a null direction invariant plus 
translations. The possibility of deforming this group to obtain "general very 
special relativity" was discussed in [52]; a physically acceptable one-parameter 
deformation was found, similar to the case of the Poincare group which can be 
deformed into the (anti-)de Sitter group. Interestingly enough, this deformed 
group turns out to be the symmetry group of an asymptotically AdS space- 
time playing a role in the construction of nonrelativistic hydrodynamics in the 
context of AdS/CFT: The "Schrodinger spacetime" [8T] 



can be viewed as a left-invariant metric on a group manifold which is a subgroup 
of the deformed group DISIM;,(n — 2)[n3l app- B]. 

On a less fundamental level, given a homogeneous space G/H with isometry 
group G, e.g. as a given solution of general relativity, one may always ask what 
deformations its isometry group admits, and try to construct spaces with the 
deformed symmetry. The two examples we give in this chapter describe this 
kind of situation. In principle, this could be used to facilitate the search for 
new solutions of general relativity, although one is restricted to homogeneous 
spaces, and curvature invariants are constant. One may still obtain spaces 
with interesting causal properties, as exemplified by the BTZ black hole |152] 
(obtained from anti-de Sitter space by periodic identifications), and by the 
closed timelike curves that we will see occur in the Petrov and Kaigorodov- 
Ozsvath solutions. 

• Alternatively, one may assume that the symmetry group is larger than the 
Poincare group. In particular, the formalism can be straightforwardly extended 
to supergroups [52] . It was found in [52j that the Killing superalgebra of the M2- 
brane in eleven dimensions admits a deformation, which leads to the conjecture 
that a solution with the deformed Killing superalgebra exists, possibly having 
an interpretation as a perturbation of the M2-brane. Unfortunately, the search 
for such a solution does not seem to have been successful*]. 

Another example is Peter West's conjecture of En as being the relevant sym- 
metry group for M-theory [162| . where one could look for possible deformations. 
*Jose Figueroa-O'Farrill, private communication (May 2009) 



ds^ = (-2 du dv - r^^du^ + dx^) + 




(2.1.1) 
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As outlined in section [TTT| the general setup for our considerations is a manifold X 
which a group of transformations G acts on transitively, which may then be identified 
with G/H where if is a subgroup of G. 

When using deformations to look for interesting manifolds "close to" a given one, 
one considers group deformations of the group G and then identifies the appropriate 
stabiliser H in the deformed group. It simplifies the interpretation of the manifold 
one obtains if this stabiliser can be viewed as a deformation of the stabiliser subgroup 
of the original undeformed group, but in general this need not be the case [62j. 

By definition, the deformation theory reviewed here can only make statements 
about Lie algebras. As explained in section II. H Lie's theorems allow us to use the 
terms "Lie algebra" and "Lie group" interchangeably, meaning by "Lie group" the 
unique connected, simply connected group specified by a given algebra. In particular, 
terms such as "Poincare group" or "de Sitter group" refer to connected components 
of the identity. 

We review the mathematical theory of Lie algebra deformations in section 12. 2[ 
and then give two examples for pairs of solutions of general relativity related by such 
deformations: The Petrov and Kaigorodov-Ozsvath solutions are discussed in section 
12. 3[ We shall see that they are both geodesically complete, and try to relate their 
causal properties. We then turn to the Kaigorodov solution in section 12.41 and find it 
can be deformed into a solution known as a "Lobatchevski plane gravitational wave" . 

2.2 Deformation Theory 

The theory of deformation of Lie algebras was reviewed in [104] , where it was con- 
nected to the perhaps more familiar operation of Lie algebra (Inonu-Wigner [88] ) 
contraction. A Lie algebra q is deformed by redefining the Lie brackets as a power 
series in a parameter t 

ftia,b) = [a,b]+tF,{a,b)+fF2{a,b) + ..., a, 6 G g , (2.2.1) 

where the series is required to converge in some neighbourhood of the origin. The 
Jacobi identity then leads to integrability conditions on the functions Fi at each order 
in t: 

J2 c) + F,{F^{a, 6), c) = , (2.2.2) 

V{a,b,c) tJ.+i^=n 

where the sum is over all cyclic permutations and n = 0,1,2, .. . 
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Choosing n = gives back the Jacobi identity for [-, ■], and at hnear order (n = 1) 
one gets 

J2 Fi(K&],c) + [Fi(a,6),c]=0. (2.2.3) 

V{a,b,c) 

A deformation that only corresponds to a change of basis in the algebra g will be 
regarded as trivial; these are of the form 

f{a,b) = S[S-^a,S-^b], SeGL{n), (2.2.4) 

and expanding 5* in a power series around the identity gives the general form of trivial 
deformations at each order in t. 

It is perhaps more common to express these operations in terms of structure 
constants Cab-, as defined in f ll.l.4p for a given basis. Denoting the structure constants 
of the deformed algebra, with respect to the same basis, by Ca ^j,, the Jacobi identity 
for the deformed algebra can be written as 

C,\A\\=^- (2.2.5) 

One can then contemplate M"^("^i)/2 (where n is the dimension of g) parametrised by 
coordinates Ca '^b antisymmetric in the lower indices, and the submanifold of M"^*-""^^/^ 
defined by the Jacobi identity. A deformation of the given algebra g describes a smooth 
curve in this submanifold which can be expanded in a power series in a parameter t: 

Ca Mt) = Ca\ + tAa\ + t^Ba\ + . . . (2.2.6) 

At linear order in t the Jacobi identity fl2.2.5p holds if 

Cdla^b'^c] + ^d[aCb'^c] = . (2.2.7) 

Then a linear deformation only gives rises to a deformation if the requirement (I2.2.5p 
can be satisfied at each order in t. 

Under a change of basis in g, expressed as a matrix S, the structure constants will 
change according to 

Ca \{t) = S\C/fiS-y^{S-y^ . (2.2.8) 

Expanding S°'b{t) = 5^+tM"'b + - ■ ■, this means that to first order a trivial deformation 
can be written as 

= M\Ca\ - C,\M\ - Ca\M\ . (2.2.9) 
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One can rephrase these conditions in the language of differential forms by choosing 
a basis {A"} of left-invariant 1-forms for the original algebra satisfying the Maurer- 
Cartan relations = -iC^^^A^ A A^ and also defining A"- = ^Ab^cA^ A A'^ to be 
a vector- valued 2-form and C^b = Cc^i,\^ to be a matrix- valued 1-form [02] ■ Here 
Cb"c cire the structure constants of the original algebra and A},"'^. is their infinitesimal 
deformation. Then at linear order, the Jacobi identity is 

DA:= dA + C AA = Q, (2.2.10) 

and a deformation that can be written as A = DS for some S — S'^i)X is regarded as 
trivial. Hence non-trivial infinitesimal deformations is in one-to-one correspondence 
with the cohomology group H'^{q,q) [53J. At higher order one has to consider the 
higher cohomology groups. In general, if a non-trivial deformation has been found 
infinitesimally {i.e. at linear order), it does not necessarily extend to a deformation 
satisfying the full set of conditions fl2.2.2p ; obstructions to "integrabihty" of infinites- 
imal deformations are given by the cohomology group H^{q,q) [53]. In practice, once 
all infinitesimal deformations have been found, one can verify the full Jacobi identity, 
and if it is violated, try to add higher order terms to the deformation. 

Using cohomology theory, one can determine all general deformations of a given 
algebra, as was done for the Galilean algebra, with and without central extension, in 
|53j . We will focus on deformations at linear order in the following. 

Applying this formalism, the authors of [12] classified all possible kinematical 
groups in four dimensions, by which they meant classifying ten-dimensional Lie al- 
gebras consisting of "translations" , "rotations" , and "inertial transformations" . This 
splitting meant singling out three generators ( "rotations" ) and fixing their commuta- 
tors with the other generators to assure isotropy of space. They further assumed that 
parity and time-reversal (defined on the generators) leave the Lie brackets invariant, 
and that the subgroup of inertial transformations is non-compact. The classification 
one then obtains consists of eight types of Lie algebras, all of which may be obtained 
by either contracting the de Sitter algebra so(4, 1) or anti-de Sitter algebra so(3,2), 
or alternatively as deformations of the Abelian algebra, as shown in figure 12. 1[ The 
possible contractions correspond to different physical limits such as vanishing cosmo- 
logical constant, infinite speed of light or zero speed of light, as explained in detail in 
|12j . For instance, the Newton groups describe the symmetries of a non-relativistic 
cosmological model with nonzero cosmological constant; this was discussed in detail 
in [M]. 
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para-Poincare/inh. 5*0 (4) 
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Newton groups 





Static (Abelian) 
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Figure 2.1: Possible contractions and deformations of kinematical groups, according to 
[T2] . Arrows denote possible Inonu-Wigner [88] contractions. 

Looking at figure 12. ![ it could be argued that the introduction of noncommut- 
ing translations, the most direct effect of a curved spacetime, is very natural from 
the group deformation viewpoint as the de Sitter and anti-de Sitter groups arise as 
deformations of the Poincare group. 

For completeness, we mention that in order to obtain the algebra of quantum 
mechanics from a deformation of that of classical mechanics, as discussed in |116] . 
one either deforms the Poisson algebra of functions on phase-space, replacing the 
Poisson bracket by a Moyal bracket 



or alternatively considers the phase-space coordinates as elements of an Abelian Lie 
algebra which is replaced by the Heisenberg algebra. In the first case one has an 
infinite-dimensional algebra, in the second case one performs a central extension to- 
gether with a deformation. The very illuminating discussion in |116] used purely 
mathematical arguments as well as physical intuition to establish the stability of the 
Heisenberg algebra, considered as a test for the "robustness" of quantum mechan- 
ics. We stress again that this is a different notion of deformation, since one is now 
considering non-linear transformations of the generators. 

The maximal algebra one could consider in this discussion is the algebra of the 
Poincare group, together with "spacetime coordinates" and an "identity" (a central 
element) which appear as operators in quantum mechanics. In four dimensions, this 



{/,^}m = {f,g} 



4-3! 



J2 cu'^^'^c^'^^'^c^*^^'^9.,.,.3(/)a,,,,,3(<7) + . . . , (2.2.11) 



h '32 'J3 
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is a 15-dimensional group which is naturally deformed into so (6 — t,t), where the 
original central element ceases to commute with the other generators in general. This 
maximally extended case is most obviously fraught with interpretational difficulties, 
as clearly illustrated in |37j . Formally, the deformation process naturally leads to non- 
commuting spacetime coordinates |116] . This proposal goes back to Snyder [151] . 
whose algebra is however not the same as the one proposed in [116] . The idea of 
"spacetime being non-commutative" is very interesting physically, however since it 
is separate from the deformation theory of spacetime symmetries reviewed in this 
chapter, it will be discussed in more detail in chapter [31 

2.3 The Petrov and Kaigorodov-Ozsvath Solutions 

Among the wide range of known exact solutions to Einstein's field equations (the most 
comprehensive source is |154j ). there are some with interesting symmetry properties. 
In this section we concentrate on the particularly simple case of a simply transitive 
group of motions, such that the stabiliser is trivial and there is a one-to-one correspon- 
dence between spacetime points and elements of the group of motions. Therefore the 
spacetime is not only a manifold but also a Lie group. One example, and in fact the 
only example among vacuum solutions without a cosmological constant, is provided 
by the Petrov solution (12.3. ip . An analogous example with negative cosmological 
constant is provided by the Kaigorodov-Ozsvath solution (12.3. lOp . 

The Petrov solution has a physical interpretation as the exterior solution of an 
infinite rigidly rotating cylinder. To gain an understanding of the physical proper- 
ties of the Kaigorodov-Ozsvath solution, we compute the stress-energy tensor of the 
boundary theory in the context of the AdS / CFT correspondence from an expansion 
near the conformal boundary. We then give a matrix representation of the isometry 
group of the Petrov solution and its algebra and use it to construct left-invariant 
one-forms on the group which could be used to compute left-invariant metrics. 

We apply the theory of Lie algebra deformations to relate the Petrov solution to the 
Kaigorodov-Ozsvath solution, and discuss the physical interpretation of this result. 
It is well known that the Petrov solution contains closed timelike curves (CTCs), so 
an obvious question is whether the Kaigorodov-Ozsvath solution exhibits similarly 
exotic properties. Since in the context of possible causality violation by CTCs a 
central question is whether these can be created by some process in a spacetime which 
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did not exhibit CTCs initially, we give a brief discussion of the possible appearance 
of CTCs by spinning up a rotating cylinder. This elementary example supports 
Hawking's chronology protection conjecture which asserts that the appearance of 
CTCs is forbidden by the laws of physics [77J. 

In a more detailed analysis of the physical properties of the Petrov and Kaigorodov- 
Ozsvath solutions, we show that they are geodesically complete. We discuss the global 
causality properties of the two spaces; while the Petrov solution is totally vicious, due 
to a theorem of Carter, a similar analysis for the Kaigorodov-Ozsvath solution remains 
inconclusive. The computation of the stress-energy tensor in AdS / CFT however sug- 
gests causal pathologies in the Kaigorodov-Ozsvath solution as well. 

2.3.1 Vacuum Solutions With Simply- Transitive Groups of 
Motions 

A = - The Petrov Solution 

The Petrov solution is introduced in |154] in the following theorem: The only vac- 
uum solution of Einstein's equations admitting a simply-transitive four- dimensional 
maximal group of motions is given by 

k'^ds^ = dr^ + e'^'^dz^ + e''(cos V3r(rf02 _ ^^2^^ - 2 sin V3r d^ dt) , (2.3.1) 

where k is an arbitrary constant, which shall be set equal to one, and we have rela- 
belled the coordinates compared to [154J. The solution was first given in |134] and 
also discussed in It describes a hyperbolic plane (the (r, 2;)-plane) with a 

timelike two-plane attached to each point. 

The isometry group is generated by the Killing vector fields 

which satisfy the algebra 

[i?,T] = iT-^$, [i?,$] = l$ + ^T, [R,Z] = -Z. (2.3.3) 

The isometry group contains three-dimensional subgroups of Bianchi types I and VII/j 
acting on timelike hypersurfaces, and the solution (12.3. ip is Petrov type I |154] . The 
first three Killing vectors obviously generate translations while the action of the one- 
parameter subgroup generated by R on spacetime is given by the integral curves of 
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R, solutions of 

^ = ^^(x(A)) . (2.3.4) 
These integral curves have the form 



x^{X) = [ tq + X , zqc'^ , (poe 2 cos A + toe ^ sin -^A , 



A 



^oe"^ sin -^A + toe~2 COS -^A I , (2.3.5) 

where we label the coordinates by = (r, z, 0, t). 

The metric components g^^p and gu become zero at certain value Jl| of r, but as 
the determinant of the metric in (12.3.11) is always —1, it is possible to extend the 
coordinates to infinite ranges and the coordinates (r, 2;, 0, t) define a global chart. 
The manifold is also time-orientable, as the vector field = (0, 0, sin ^r, cos ^r) 
defines a global arrow of time, though this amounts to ^ being future- as well as 
past-directed at some points (as well as spacelike at others). 

Bonnor [25J pointed out that the solution can be viewed as a special case of 
the exterior part of a Lanczos-van Stockum solution |103l 1155] describing an infinite 
cylinder of rigidly rotating dust. Since this allows a physical interpretation of f l2.3.ip . 
let us give the general solution for an infinite rigidly rotating dust cylinder, which in 
Weyl-Papapetrou form is given by |157] 

ds'^ = H{p) {dp^ + dz^) + L{p)dx^ + 2M(p) dx dr - F{p)dT^, (2.3.6) 

where H, L, M, F are functions of the radial variable p containing two parameters 
a and R, interpreted as the angular velocity and radius of the cylinder respectively. 

The high energy case aR > |, which contains closed timelike curves (CTCs), is of 
interest here, and with the choices R = y/e and aR = 1 the exterior solution is given 
by 

H{p) = 1, L{p) = -2y|psin (Vsiog^^ , (2.3.7) 

M(p) = Apsin (I + v^log^) , Fip) = -|=psin - Vsiog^) , 
where p is a radial coordinate in the exterior of the cylinder and so is restricted to 

p>y/e, (2.3.8) 



Hhese values of r can be shifted by a coordinate transformation (j> a(p + (3t, t — > — + and 
hence have no coordinate-independent significance 
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and X is an angular coordinate and periodically identified with period 27r; r and z 
are unconstrained. Applying the coordinate transformations 

~z = ^ez, p = v^e^ x=^^^ (^^J 2 + V?>z + ^J 2 - V?>?j ,T=^{z-t) 

(2.3.9) 

to the line element f l2.3.6p indeed gives back ( I2.3.ip . Hence if we adopt the interpre- 
tation of (12.3. ip as describing the exterior of a spinning cylinder, we restrict the coor- 
dinates to r > and identify a/ 2 + a/3z + ^2 - a/S* with \/ 2 + a/3z + a/2 - Vst + 
27rv^V2e. 

The general Lanczos-van Stockum solution has three linearly independent Killing 
vectors ^ and ^ but a fourth Killing vector is only present in the special case 
aR = 1 because the algebraic invariants of the Riemann tensor are independent of p 
just in this case 



A < - The Kaigorodov-Ozsvath Solution 

A solution of the vacuum Einstein equations with negative cosmological constant 
which has a simply-transitive four- dimensional group of motions was first given by 
Kaigorodov [93] and rediscovered by Ozsvath |125j . It has the line element 

ds"^ = -jdr^ + e-^'Xdz'^ + 2dt dcf)) + e^'rf^^ - 2^26" dz dcf) . (2.3.10) 

This solution was also given in |154] . It is Petrov type III and the metric asymptoti- 
cally (as r — )■ — oo) approaches that of anti-de Sitter space. Obvious Killing vectors 
are 

d d d 

and the metric (I2.3.10p has a further isometry 

r ^ r + A, z^ e^z, (p e"^^0, t e^H (2.3.12) 

which is generated by the fourth Killing vector R = + z^ — 20^ + 4t^. The 
Killing vector fields satisfy the algebra 

[R, Z] = -Z, [R, $] = 2$ , [R, T] = -AT . (2.3.13) 

No analogous solution for a positive cosmological constant exists [125(1154] . Because of 
the similarity to f l2.3.3p one would expect (12.3. 13p to arise as a deformation of (I2.3.3p . 
Physically both algebras describe the isometrics of vacuum solutions of Einstein's 
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equations, one with A < and one with A = 0, and hence one might expect one of 
them to arise as some hmit of the other. 

This spacetime is also time-orientable, as the vector field = (0, "1; 1) 

defines a global arrow of time. 

We can express (12.3. lOp in coordinates corresponding to Poincare coordinates on 
AdS (with A = -3) 



The limit p — in Poincare coordinates corresponds to the timelike boundary X 
of anti-de Sitter space. After setting p = p^ the line element is 

cis^ = ^ + i (^z^ + rf02 _ ^^2 _ 2p3/2 ^^^^^ ^ ^^-^ ^ ]^~p^{dt + , (2.3.15) 

and (12.3. 15p is an expansion of the form 

c^s' = ^ + Ig.udx^dx'' , g,,{x, p) = gf^x) + gf}{x)p + gfK^)p'"^ + ■■■ (2-3.16) 

as given in |8Qj. The coefficient g^^'' = —dz ® {dt + dcj)) — [dt + dcj)) ® dz encodes 
the stress energy tensor of the boundary dual theory in the context of the AdS/CFT 
correspondence [151 [80]. It does not satisfy even the null energy condition on the three- 



dimensional conformal boundary, since gj^Jn'^n'^ = —2 for the null vector ^ = §i + -§^- 
This presumably reflects causal pathologies of the bulk spacetime that we will try to 
understand by more conventional means shortly. 

A general analysis of stationary cylindrically symmetric Einstein spaces was done 
in |1U9] . These authors assume the Lewis form of the metric, where a cross term dz dcj) 
would be absent. We did not find it possible to bring (12.3. lOp to the Lewis form. The 
theorem by Papapetrou |127] that any solution with two commuting Killing vectors 
(one timelike, one spacelike with periodic orbits) can be written in the Lewis form, 
only applies to solutions of the vacuum Einstein equations without cosmological term. 
Hence one cannot make a connection with spaces of the Lewis form as was possible 
for the Petrov solution. 



2.3.2 Left-Invariant Forms 



Since the action of the elements of the group manifold on itself has been given in 
(I2.3.5p . we can write down a matrix representation of this group of motions, with a 
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general element given by 

A 

e'" 

9= 



^0 

The group is generated by 






e~5 cos(^r) 
—e~i sin(^r) 






e~5 sin(^r) 
e~5 cos(^r) 



(2.3.17) 
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(2.3.18) 



(2.3.19) 



so that g = e^^+<t>'^+^T^rR coordinates on the group. The generators 

satisfy the algebra 



[R,T] = -^T + y^<i>, [R,<!>] = -U-^T, [R,Z] 



(2.3.20) 



2 2 

This differs from the Killing algebra fl2.3.3p by the usual overall minus sign coming 
from the fact that right-invariant vector fields generate left actions and vice versa. 
The matrix representation 02.3.171) gives the group multiplication law 

(r, z, (j),t) ■ (r', z', (p',t') = (r + r',z + e'^z', + e~^{t's + (p'c),t + e^''^{t'c - (p' s)) , 

(2.3.21) 



where s = sin ^r, c = cos ^r. The Maurer-Cartan form is 



g-^dg = e^^^^iZ dz + <l> d(j) + T dt)e''^ + Rdr = RX^ + Z X'^ + <l> + T , (2.3.22) 



which gives the desired basis of left-invariant one-forms: 

X^ = dr, = e-" dz , 
A^ 



62 I COS I 



— sm I 



dt 



6 2 I sm I 



+ cos ( ) dt 



(2.3.23) 
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We obtain a left-invariant metric on the group 

= dr^ + e-^'dz^ + (cos{V3r){d(j)^ - df) - 2sm{V3r)d(l)dt^ (2.3.24) 

with r] = diag(l, 1, 1, — 1), which is the same as fl2.3.ip and shows that our chosen 
coordinates agree with the initial Petrov coordinates. We see how to recover a metric 
on a group manifold; note that one could obtain this metric by just starting from the 
algebra ^IX^ . 

2.3.3 Deformations of the Petrov Killing Algebra 

We examine possible infinitesimal deformations of the four- dimensional Lie algebra. 
Using Mathematica for the computations, equations (12.2.71) give the following con- 
ditions on linear deformations: 

= A/^ + V3At\ = A/,-V3A/^ = A/^; 

= 2A/^ + A/^ + V3At% = -V3A/^ + 2At\ + At', = A/^ = Ai^; 

= Ar\ + 2A/^ + V3AJ^ - V3Att = ^A;\ + 2A-,\ - V?>A,\ - V?>A^\ 

= \/3A/^ + At'*'^ - V3A^\ - At'r = VSA/^ - At'' ^ ; 

= V?>Ar\ - VSA/^ - 2A,\ - V?>At\ = Ar\ - ^A,\ + 2At\ + ^A^\ 

= A/^ - VSA/^ + V3At\ - At% = A:^ + ^At\ . (2.3.25) 

These constraints for Aah reduce the number of free parameters from 24 to twelve. 
We list the most general deformation parameters satisfying (I2.3.25P in a table: 

Table 2.1: Infinitesimal deformations of the Petrov Killing algebra. 





c = r 


c = z 


C = 


c = t 


A 


2C 


X\ 


X2 


X3 


A " 


-^A-B 


X4 


X5 


Xq 


A ^ 


-A^^flB 


X-j 




Xg 


A " , 





2B 


c 









2A 


-V3C 


c 


A,\ 








-^B- A 


B - ^A 



Deformations of Spacetime Symmetries 



32 



The parameters xi, X2, . . . , xg, A, B and C can be arbitrary real constants. We need to 
investigate which of these correspond to trivial deformations. The conditions fl2.2.9[) 
mean that trivial deformations can be written as 



(2.3.26) 

The parameters C, Xi, X2 and X3 correspond to trivial deformations and can be set to 
zero by a change of basis. Furthermore, 



A/^ = \m^^ + , A/, = -^M\ + \m\ , 



A.. 



-M'\ , A 



1 t 



(2.3.27) 



etc., so that we can set A = S = 0, 



A 



r (p 



^M% + ^M^, Ar 
2 



(2.3.28) 



so that we can set X4 = X5 = Xg = 0, 



Ar\ = - , Ar\ = - + 



73 
2 



73 
2 



73 
2 



2 



r t 



.\m\-^m\-^m' 

2 2 2 



(2.3.29) 



so that we can set xj = 0, but must treat xg and xg as nontrivial perturbations. After 
a relabelling of these parameters, the modified Lie algebra is now 



[R^ T] = -aT - 6$ , [R, $] = --$- , [R, Z] = Z . 



(2.3.30) 



These relations satisfy the full Jacobi identity and so the linear deformation indeed 
defines a deformation of the Lie algebra. We may modify the matrix representation 
by setting 

/o A 

10 



R 



-| -6 
-a 

\o oy 



(2.3.31) 
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In the case where a 7^ |, one can always find a hnear transformation of the basis 
vectors $ and T such that the algebra takes the more symmetric form 



[R, T] = a'T + 6'$ , [R, $] = a'^ ± b'T , [R, Z] = Z 



(2.3.32) 



with ± depending on the value of b in the original deformation. This means that 
there are three distinct cases: 

First case: Positive sign. Then a matrix representation of R is 



R 



(0 











A 





1 

















a' 


V 











h' 


a' 
















V 



(2.3.33) 



and a general group element looks like 

/l 



e' 












e"''' cosh(6'r) e"'*" sinh(6'r) (j) 
e"'*^ sinh(6V) e'^''' cosh(6'r) t 
1 



(2.3.34) 



The Maurer-Cartan form is 

g-^dg = Ze-^dz + ^ e~"''(cosh(6V) dcj) - sinh(6V) dt) 
+Te"''''(cosh(6V) dt - sinh(6V) #) + Rdr 



(2.3.35) 



and we can read off the left-invariant forms. 

The special case a' = 1, 6' = 3 gives the algebra of the Kaigorodov-Ozsvath 
solution ( 12.3. lOp . as can be seen by setting $' = $ — T and T' = $ + T, which 
amounts to 

[i?, $] = $ + 3T , [i?, T] = 3$ + T ^ [R, $'] = -2$' , [R, T'] = AT' , (2.3.36) 

which is just the sign- reversed version of f l2.3.13p . One can recover the metric (12.3. lOp 
by choosing the symmetric matrix 

\ 

1 -V2 V2 
-V2 -1 -1 
V V2 -1 3 / 



h 



ab 



(2.3.37) 
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and computing the left-invariant metric 



habX^ O A'' = -jdr^ + e-'^'dz^ - 2e~^''{d<i? - dt^) + e^"(# - dtf - 2V2e'' dz{d<p - dt) , 

(2.3.38) 

which after the coordinate transformations (/) — t = (p' and —(j) — t = t' reduces to 
(12.3. lOp . Since the matrix (I2.3.37P has one negative and three positive eigenvalues, 
there exists a (vierbein) basis of left-invariant one-forms a" such that rjabcr"' ® cr^ gives 
the metric (12.3. lUI) . 

Second case: Negative sign. 

/ooooi\ /lO 



10 

R= a' b' 

-b' a' 

\o oy 



The Maurer-Cartan form is 



— „zZ+<t>^+tT rR 



A 



e'^ ^ 

e"''' cos(6V) e'^''' sin(6V) 
-e"''' sin(6'r) e"''' cos(6V) t 



\0 







1/ 
(2.3.39) 



g-^dg = Ze-'^rf^+<l>e-'^'^(cos(6V)#-sin(6'r)dt)+Te-'^'"(cos(6'r)rft+sin(6V)d0)+i?(ir 

(2.3.40) 

and a left-invariant metric will be given by 

ds^ = Vab)^"" ® = dr^ + e-^'dz^ + e-^''''^{cos{2b'r){d(f)^ - df) - 2 sm{2b'r)d(f)dt) . 

(2.3.41) 

The original Petrov algebra is of course the special case a' = — |, b' = 
For the metric (I2.3.4ip the Ricci tensor has non-vanishing components 

R^^. = -l- 2a'2 + 26'2 , R^^ = -(l + 2a')e-2'' , 
R^^ = -(1 + 2a')e"^"'^(a'cos(26'r) + 6'sin(26V)) , 
R^t = (1 + 2a')e~^'''''{a'sm{2b'r) - b' cos{2b'r)) , 
Rtt = (1 + 2a')e-2"''-(a' cos(26'r) + b' sin(26V)) . 



(2.3.42) 



The manifold is an Einstein manifold only if a' 



\ and b' = ±^ (A = 0, Petrov 



solution) or a' = 1 and 6' = (A = —3, anti-de Sitter space). In the general case the 
energy-momentum tensor defined by T^j^ = -^qG^v does not satisfy the weak energy 
condition; without loss of generality assume sin(-\/3r) = and cos(-\/3r) = 1 and 
choose a timelike vector = (0,0,t3,t4) {t^ > tf), then 

G^a^'t'' = (1 + a' + - b'^){tl - tl) - 2(1 + 2a')b'tsU (2.3.43) 



Deformations of Spacetime Symmetries 



35 



can be made arbitrarily negative by letting ^3,^4 — >■ ±00 while keeping t| — 1| small 
and positive, unless a' — and 6'^ > |. In the case a' — and 6'^ > |, the 
Einstein tensor can be written as 

G^,^-Xg^, + 2\u^u,, A = 6''-^>0, ^ (1,0,0,0) , u^u^ ^ 1 . (2.3.44) 

Note that is spacelike. This tensor satisfies the dominant energy condition as 
2Vv^ — f^Va > for any timelike and future-directed t,v. These statements are 
independent of the choice of the arrow of time, i.e. hold for both ^4 < or i4 > 0. 
Third case: a—^. Let us introduce a new parameter c, so that 



[R,T] ^ ~T - ^c'^ , [R,^^- 
for positive b which gives 

fo 

1 

R= 



^0 

and a general group element looks like 

/l 



l^-^T, [R,Z]^Z 



l\ 







1 

"2 

2 



2c2 







0) 






e~5 cosh (cr) 








e7 


— ^e~^ sinh(cr) cosh (cr) t 



\0 

The Maurer-Cartan form is 



— -^e 2 sinh(cr) 

ly 



(2.3.45) 



(2.3.46) 



(2.3.47) 



g ^dg — Rdr + Ze dz + ^ [ e'^ cosh{cr) d(f) + ^^e/^ smh{cr) dt 



V3 



x/3 



-t-T ( 62 cosh(cr) dt + "^^2 sinh(cr) dcf) 



(2.3.48) 



In the limit c — > or 6 — > this becomes 



g-^dg ^ Rdr + Z e-'' dz + ^ d(f) + T { ei dt + ^e^r d(j) | . (2.3.49) 
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The only remaining case, namely negative b, is 

[i?,r] = -^T+-^c^$, [R,^^-U-^T, [R,Z]^-Z, (2.3.50) 

which turns the hyperbolic into trigonometric functions. 

If c 7^ one can rescale the coordinate t (for instance) and recover the same left- 
invariant forms as before, hence this does not give anything new. In the case c — a. 
left-invariant metric is given by 

ds^ = dr^ + e-^'dz^ + e''{^{l- ^r^^ - VSr dcj) dt - dt'^ . (2.3.51) 

The Einstein tensor for this metric can be written as 

= ^g^v + I (diag (-2, e''^ 2e^ O))^^ . (2.3.52) 

This does not satisfy the weak energy condition as Gf^vt^f = —^(3 -|- e*") < for the 
timelike vector = (1,0,0,1). 

Of course, for all three possible cases considered, one could take any constant 
symmetric matrix with one negative and three positive eigenvalues instead of r]ab to 
construct a left-invariant metric; this gives a 10-parameter family (subject to con- 
straints) of metrics whose Killing algebras contain one of the deformations of the 
four-dimensional Petrov KiUing algebra as a subalgebra. One of these parameters 
can be absorbed into an overall scale, and any cross terms drdz, dr d(f) or drdt can 
be absorbed into a redefinition of and t. One is left with a 6-parameter family 
of metrics, by far most of which will not appear to have a physical interpretation. 
We have chosen to just give examples; more efficient methods of constructing general 
metrics with a given isometry group are certainly known. 

2.3.4 Spinning Cylinders 

In the case of an infinite rigidly rotating dust cylinder one could imagine trying to 
speed up this cylinder by shooting in particles with some angular momentum which 
enter the interior region on causal curves and increase the angular velocity a, so as 
to reach and surpass the critical value aR — \ above which CTCs appear. We will 
show that this is not possible. 

The interior part of the general van Stockum solution 

ds^ = H{p) {dp^ + dz^) + L{p)d(t)'^ + 2M{p)d(t) dt - F{p)dt^ , (2.3.53) 
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describing the region p < R is given by |157] 

i7 = exp(-aV), ^ = P^(l-aV), M = ap'^ , F = l. (2.3.54) 

As there are closed timehke curves for p > ^ we require aR < 1. The exterior solution 
is for a < |, from now on setting R = 1 for simplicity which is no loss of generality, 

-a'^ -2a'^ T psinh(3e + 6') psinh(e + 6') ^ psinh(e — '^'^ 



H = e-" p-^" , L = . -r^ , M 



2 sinh 2e cosh e ' sinh 2e ' sinh e 

(2.3.55) 

where 6{p) = vl^^4c?log p and e = ArtanhVl — 4a^. Note that always —FL—A/P = 
— p^ and so the metric has the right signature for all p (this of course is also true as 
a — )■ |). The point-particle Lagrangian is (a dot denotes differentiation with respect 
to an affine parameter A) 

C = g^^x^x" = H{p^ + z^) + + 2M0i - Ft^ , (2.3.56) 

and since the Lagrangian does not depend on and t there are three conserved 
quantities associated with geodesies: 

P = Hz, J = L(j) + Mi, E = Fi-M(j). (2.3.57) 

The Lagrangian for timelike or null geodesies becomes 

£ = ifp2 + + (FJ2 _ 2MEJ - LE^) =-^l^ <0 (2.3.58) 
H p^ 

and we obtain the radial equation 

-K7/(P))' (2-3.59) 
where we have introduced an effective potential 



Hp^ ( 


pV^ 




L 











K»^^.±p^J^(,^ + ^ + ^). (2,3,60) 

This is well-defined for all p as if, L, M all remain positive for all p. A particle 
falling in on a geodesic can enter the cylinder if 



(2.3.61) 



1 — \ \ — \ 1 — / 1 — 1 — a 
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Any particle entering the cylinder on a geodesic must have ^ < 1 — a. In the limit 
a — )■ the conserved quantities J and E clearly describe angular momentum and 
energy per mass. We can identify the ratio with the angular velocity of an infalling 
particle at = 1. 

If we are considering accelerated observers, equation (12.3.581) still holds, but P, E 
and J will no longer be conserved quantities. However, only the local values of these 
quantities at i? = 1 will decide about whether or not a particle will be able to enter 
the interior region of the cylinder. 

This means that the above considerations also hold for accelerated observers and 
as any particle entering the cylinder must have ^ < \ for a = |, one cannot speed 
up the cylinder beyond a = | using particles on timelike or null curves. 

2.3.5 Geodesic Completeness 

We ask whether the Petrov spacetime, with the radial coordinate r extended to take 
arbitrary values, is geodesically complete, i.e. whether all timelike and null geodesies 
can be extended to infinite values of the affine parameter. First we give an example 
that this need not be possible on a group manifold: Remove the null hyperplane z = t 
from Minkowski space and consider the half-space z > t, denoted by M~ . It is clearly 
geodesically incomplete. Null translations and boosts 

{z,t) ^ {z + c,t + c); {t + z,t- z) ^ (^X{t + z),j{t~ z)^ (2.3.62) 

act on M~ , and together with translations {x,y) — {x + a,y + b) they form a four- 
dimensional group which acts simply-transitively on M~ . For instance, the point 
{x, y, z, t) = (0, 0, 1, 0) is, by a null translation and a successive boost, taken to 

(0, 0, 1, 0) ^ (0, 0, 1 + c, c) ^ (^0, 0, Ac + ^ - ^, Ac + i + . (2.3.63) 

There is a one-one correspondence between points in M~ and group parameters 
(a,6, A,c), where A > 0. The space M~ can be identified with the group G x R^, 
where G is the unique two-dimensional non-Abelian Lie group. 

As a second example, introduced in a slightly different context in ^76j, consider 
the dilatation group generated by translations and dilatations 



x^ ^ x^ + , x^ ^ px^ , 



(2.3.64) 
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where we denote the generators by Pa and D respectively. Parametrising the group 
elements g = e^'^^^-e^^, the Maurer-Cartan form is 

g-^dg = e-^dx" Pa + d\D (2.3.65) 

and hence a left- invariant metric is 

ds^ = dX^ + e'^\^.dx''dx'' = ^{dp^ + r]^udx''dx'') , (2.3.66) 

where p = e^. This is the metric of anti-de Sitter space in five dimensions in Poincare 
coordinates, which is geodesically incomplete as these coordinates cover only a patch 
of the full spacetime. Hence geodesic completeness is a non-trivial property of a group 
manifold. 

To show geodesic completeness of the Petrov solution, we need to show that no 
geodesic reaches infinity for finite values of the affine parameter. Consider the La- 
grangian 

C = g^.x^x" , (2.3.67) 

which for the metric fl2.3.ip is 

C = P + e-^^'P + (cos VSr (0^ -P)-2^i sin VSr) . (2.3.68) 

Evidently, from the Euler-Lagrange equations, there are three conserved quantities 
because the Lagrangian does not depend on or t explicitly, the conjugate momenta 

P = e'"^^ z , j = {4> cos y/Sr — i sin VSr) , E = {4> sin y/Sr + i cos \/3r) . (2.3.69) 

The Lagrangian now takes the form 

C = r^ + e^^P^ + e"'' (cos V3r{f - E^) + 2Ej sin VSr) , (2.3.70) 

and since the Lagrangian is a conserved quantity in geodesic motion the equation 

(^^^ =e-'XE^cosV3r-2EjsinV3r-fcosV3r)-e^'-p^-p^ (2.3.71) 

is satisfied by any geodesic, where p^ is positive for timelike, zero for null and negative 
for spacelike geodesies. For timelike geodesies {p'^ > 0), right-hand side of fl2.3.7ip 



■l-The coordinate A only covers dilatations with p > 0, so that one may obtain geodesic complete- 
ness by adding the disconnected component with p < 0. This suggests that geodesic completeness 
of groups is a topological question that cannot be decided at the level of the Lie algebra. 
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becomes negative for large r, so that r is bounded and we may extend geodesies 
infinitely. For null geodesies, r is boundecj^. So for any finite values of the affine 
parameter, r remains finite and so do i, (f) and t. Hence the Petrov spacetime is 
geodesically complete. It will be incomplete if we interpret it as the exterior solution 
of a rotating cylinder and cut off the region described by the Petrov solution at some 
value of r. 

By very similar arguments we can show geodesic completeness of the Kaigorodov- 
Ozsvath solution with line element f l2.3.14p . In this case, the Lagrangian is 

£ = 1 fp2 ^ i2 + 02 _ 12\ _ + 0) + 1 ^ ^2.3.72) 



and the conserved quantities are 

P = lz-pii + ^), j = \^-pz+\p\i + <P), E = \i + pz-\p\i + <P) (2.3.73) 
p"^ p"^ 2 p^ 2 

so that the radial equation is 

'^P\ , „-2 77i2\ <^^1t,I- I 77i\ ^10/ ■ I 771N2 ^2, ,2 



^ -p\P'^3' - E') - 2p^P(j + E)- -p^'U + EY - pV^ . (2.3.74) 

The right-hand side becomes negative if /x^ > for both p — )■ oo and p — )■ 0, hence all 
timelike geodesies are bounded in p. For p = the right-hand side is either constant 
zero or becomes negative for large r, so null geodesies are bounded from above in p. 
For very small p, p goes to zero, so as before p is bounded for null geodesies. This 
shows geodesic completeness. Note that in the case of anti-de Sitter space (I2.3.74p 
would be ^ 

'^^^ -p\P^+3^-E^)-p'p^ (2.3.75) 



and depending on the magnitudes of E, j and P the right-hand side blows up as 
p — 7- oo for some geodesies, which can reach p = oo in finite affine parameter distance. 



2.3.6 Causal Properties 

It was observed by Tipler |157] that the Petrov solution contains CTCs. As shown 
in |158] , its causal behaviour is actually even more pathological: The Petrov solution 

^If we allow r to take negative values, then for 7^ or j 7^ the right-hand side becomes 
oscillatory for large negative r, taking positive as well as negative values. Hence both timelike and 
null geodesies are bounded from below in r. For E = j = Q the right-hand side is either constant 
zero or always negative. 
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is totally vicious (for a summary of causality conditions see [59]), i.e. for any two 
points a and b one has a < b and b < a, where "<" is the usual causal relation "can 
be joined by a timelike curve". The proof of this statement proceeds by giving the 
future and past light cones at the identity element of the group manifold, and the 
observation that the adjoint action of elements of the form exp(A-R) rotates the cone 
around, such that for certain values of A a future-directed timelike tangent vector is 
mapped to a past-directed timelike tangent vector. 

One might hope that a similar statement could be made for the Kaigorodov- 
Ozsvath solution, but this turns out not to be CclSG, clS IS apparent from the deformed 
group element fl2.3.34p or the commutation relations (12.3.131) . The action of R on the 
plane generated by $ and T is now just a scaling of the generators, which amounts 
to an opening and closing of the light cones, but no rotation. The assumptions of 
Theorem 2 in [ 158j are not satisfied. 

Apart from the results in [158] which refer to homogeneous spacetimes, we may try 
to apply a criterion for the causality properties of spacetimes with an Abelian isometry 
group acting transitively on timelike surfaces which was given by Carter in his 1967 
PhD thesis [31]. For the Kaigorodov-Ozsvath solution, this Abelian isometry group 



is generated by ^ and ^ which act on surfaces {r = constant}. Carter's result is 
that a sufficient condition for such a spacetime to be totally vicious is the nonexistence 
of a one-form in the dual of the Abelian Killing algebra which is everywhere timelike 
or null. In the present case, such a one-form is of the form u = A dz+B d(f)+C dt, with 
A, B and C constants. Conversely, if there is such a one-form which is everywhere 
timelike, the spacetime is virtuous, i.e. causally well-behaved. In the present case, 
one finds that the norm of such an u is 



e^r j^^2 ^ 2v^A C e^" + C{2B + Ce^"^)) , (2.3.76) 



where g is the contravariant version of (12.3. lOp . In order to make (12.3. 76p non-positive 
for all values of r, one has to choose A = C = to be left with the null one- 
form d(j). Put differently, the subgroup generated by ^ and ^ acts on null surfaces 
{r = (j) = constant}; were these spacelike it would follow that the Kaigorodov-Ozsvath 
solution is causal (cf. Proposition 9 in [33] )• 

Carter's condition for causality violation was used by Tipler in to show the 

pathological behaviour of the Petrov solution. Here the situation is less clear; we 



^whose wording is slightly unfortunate, since he states the relevant condition on lo as being 
"everywhere timelike" 
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cannot show directly that in our example spacetimes related by Lie algebra defor- 
mation have similar causal properties. The strongest indication we have for causal 
pathologies of the Kaigorodov-Ozsvath spacetime comes from the computation of the 
boundary stress-energy tensor in AdS/CFT in (12.3. 16p . 

2.4 The Kaigorodov Solution and Lobatchevski Plane 
Gravitational Waves 

As a different example of a homogeneous space which appears as an exact solution in 
general relativity, we consider the Kaigorodov solution, which has a well-known inter- 
pretation as the AdS analogue of homogeneous pp- waves on a Minkowski background 
|139] . It has received recent interest in the context of string theory as the near- horizon 
limit of an M2 brane 1129] . By using Lie algebra deformations we again try to 
find a spacetime which has a physical interpretation "close to" the Kaigorodov solu- 
tion. An obvious candidate would be a member of the family of "Lobatchevski plane 
gravitational waves" discussed by Siklos in jl49j as spacetimes admitting a Killing 
spinor. The expectation that the particular solution considered by Siklos which has 
a five- dimensional isometry group arises as a deformation of the Kaigorodov solution 
will be confirmed. Since the isometry group is now five- dimensional and not four- 
dimensional as in the previous section, one has to consider a quotient G/H. The 
larger symmetry will place constraints on the general form of a left-invariant metric. 

2.4.1 The Kaigorodov Solution 

The Kaigorodov solution (of Petrov type N) [93j is another example of a homo- 
geneous space-time which solves the vacuum Einstein equations with a cosmological 
constant A < 0. The form given in |154] is 

12 

ds^ = --^dz^ + lO/c e^^dx'^ + e'^^dy'^ - 10m e^dz dx - 2e^ du dx . (2.4.1) 
The five- dimensional Killing algebra of f l2.4.ip is spanned by 
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The stabiliser of the origin is the one-parameter subgroup generated by W, which we 
call H to be consistent with previous notation. The transformations generated by Z 
are 

z^z + X, y^ye^^, x xe"^ , (2.4.3) 



while W generates 



y ^ y + Xx, u ^ u + ye ^''X + -xe ^''A^ 



(2.4.4) 



The Killing vectors have non-zero commutators 



[W,Y] = -U, [W,X] = -Y, [Z,U] = -5U, 
[Z,Y] = -2Y, [Z,X]=X, [W,Z]=3W; 



(2.4.5) 



Y, X and U span a three-dimensional Abelian subalgebra. The generators of the 
kinematical group will have the sign-reversed algebra 

[W,Y] = U, [W,X] = Y, [Z,U] = 5U, 

[Z, Y]=2Y, [Z, X] = -X, [W, Z] = -3W . (2.4.6) 

We denote the isometry group by /C 0; it has three-dimensional subgroups of Bianchi 
types I, II and Vlh- The spacetime is naturally represented as a coset space 



M = 1C/H. 

Represented as matrices, the group /C is generated by 



(2.4.7) 
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II The JC stands for Kaigorodov or Killing, according to taste 
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Any element of the group can be parametrised by 

g = ^xX+yY+ue^'U^zZ^^W ^ ^2.4.9) 

such that (x, z, u) are coordinates on the coset space Ai. The Maurer-Cartan form 

a;Mc = g'^dg = CaX" , (2.4.10) 

where Ca are the group generators, gives a basis of left-invariant one-forms on /C: 

= dx , = e~^^ dy — w dx , = dz , 
X"^ = du — w e'"^"" dy + ^w"^ e"" dx + 5u dz , X^ = dw + 3w dz . (2.4.11) 

As usual, one can now construct a left-invariant metric on the group by 

ds^ = GabX^X\ (2.4.12) 

where Gab is constant and non-degenerate. Any such metric will depend on the 
coordinate w on W, since the Lie algebra splitting 

« = m©f) (2.4.13) 

is not reductive: [W, Z] = 3W where one would require [i),m] C m. (Recall the 
definition of a reductive geometry given in fll.1.61) ). 

The original Kaigorodov metric can be reconstructed by taking 

ds^ = 10A;(A1)2 + (A2)2-^(a3)2_2A1A^ + (A^)2 (2.4.14) 

12 

= lOA; e'^'^dx'^ + e'"^^ dy'^ - —dz^ - Id" dx{du + 5 m rfz) + {dw + 3w dzf 

and reducing to the coset by dropping the last term. 

We summarise the most important physical properties of this solution, as discussed 
in more detail in [139] : The Kaigorodov spacetime has a curvature singularity at 
z = oo, and z = —oo represents null and spacelike infinity; the metric approaches 
the AdS metric asymptotically as z — )■ — oo. The patch covered by the coordinates 
{x,y,z,u) can be extended by changing coordinates to g = e^^, and extending to 
negative q. The enlarged spacetime splits into two disjoint regions g > and g < 
which cannot mutually communicate. Then X = ^ is a timelike Killing vector for 
g < 0, and the enlarged spacetime is stationary in this region. This means that 
gravitational radiation can exist in a stationary spacetime. 
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2.4.2 Deformations of the Lie Algebra 

In order to find infinitesimal deformations of tlie Killing algebra fl2.4.6p . we again 
consider the Jacobi identity at linear order f l2.2.7p . again using Mathematica for 
explicit computations. Imposing these constraints on Aa^i^, one is left with 20 free 
parameters, as shown in table 12.21 



Table 2.2: Infinitesimal deformations of the Kaigorodov Killing algebra. 





C = X 


c = y 


c = z 


c = u 


c = w 


A ^ 

y 


(2e + r7)/6 


(25 + 5r)/3 





-7/4 





A ^ 


-UJ + 2P 


3a + P 


-(5 + r)/3 




7 


A 


(t/3 







5/3(5 + r) 





A ^ 


(Cg - 2C8 - 5C7)/5 


Cl 


(2e + r7)/6 


a 


5 




(6c9 - 7c8 - 15c7)/5 


-cu + P 


(2e + r7)/3 


/3 


T 







-2a/3 





(5e + 477)/3 





^y-w 


m 


Cl 


a/3 




e 


A ^ 

■^z u 


—6m 


C8 


-5a/3 




V 


A 

■^z w 




C4 


(3C9 - C8)/5 




P 


A " 





—m 





C9 


a 



Again it turns out that almost all of the possible deformations should be considered 
trivial {i.e. correspond to a change of basis in the Lie algebra). Only a one-parameter 
deformation, which can be given by cannot be obtained by a basis transformation. 
This infinitesimal deformation satisfies the full Jacobi identity, and we obtain the 
new algebra 

[W,Y]=U, [W,X]=Y, [Z,U] = {^ + a)U, 

[Z, Y] = {2 + a)Y , [Z, X] = {a- l)X , [W, Z] = -3W . (2.4.15) 

One would perhaps have expected that at least one possible deformation would be to 
deform the commutation relations of Z to arbitrary numbers: 

[Z, U] = aU, [Z, Y] = bY, [Z, X] = cX , [W, Z] = dW . (2.4.16) 

The one-parameter deformation we have obtained is the most general deformation of 
this type: By a rescaling of Z one can always keep one of the four numbers a, b, c, d 
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fixed. Then the Jacobi identity puts two constraints on changing the coefficients in 
the three other commutators, leaving a single deformation parameter. 

Put differently, one is free to replace the deformed algebra fl2.4.15p by any one- 
parameter deformation of the action of Z which satisfies the Jacobi identity, e.g. 

\W,Y\ = U , [l^,X] = r, [Z,f/] = 5f/, 

[Z, F] = (2 + a)Y , [Z, X] = (2a - \)X , [IV", Z] = (a - 'i)W . (2.4.17) 

We will use this last form in the following. In the matrix representation we only have 
to modify the matrix for Z: 

/s o\ 

1 

2 + a . (2.4.18) 

2a-l 

\0 Oy 

2.4.3 Construction of the Manifold 

We are trying to construct a metric on the quotient space 

M = IC'/H, 



(2.4.19) 



where the stabiliser H is again generated by W. We use the same parametrisation as 
before: 

g = ^xX+yY+ue^^U^zZ^^wW (2.4.20) 

By computing the Maurer-Cartan form one finds the left-invariant forms are now 



= e(i-2")^ da; , X"^ = 6-^^+"^^' dy - w e'-^~^°'^' dx , = dz , (2.4.21) 
X^ = du-w e-(2+°)^ dy + \u? e^^-^a)^ + 5m rf^ , = rfw + (3 - a)w dz . 



Again, it is not possible to find a left-invariant metric 



ds^ = GabX^X^ 



(2.4.22) 



which is independent of w. We can, however write any such metric in the form 

ds^ = G^^idm + A^dx^f + G^^dx^dx" , (2.4.23) 
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where = {x,y,z,u), and demand that Gfj,u, in order to be a metric on the coset 
space, be independent of w. This reduces the number of free parameters in G from 
15 to nine, and the coset metric is (setting G55, which determines the overall scale, 
to -1) 

ds^ = p3e"(2-4") dx^ + pie-2"(2+a) ^ p^^^2 _ 2e-(i-2") dx{pidu + (5 Pi M - pi)dz) , 

(2.4.24) 

where we have defined 

Pi '■= ^25 ~ G22G55 , P2 = — GsaCss , Ps = 6*15 — GiiGss , p^ = GisGss — Gi^G^^ . 

(2.4.25) 

It is clear that by shifting n by a constant one can always achieve that p4 = 0. This 
means the most general metric on the deformed coset space is 

ds^ = p3e"('-'") dx^ + pie-2"(2+a) ^ p^^^2 _ g^(i-2a) ^^(^^ ^ 5^ _ (2.4.26) 

One discovers that the determinant of fl2.4.26p is 

det g = -e-^^^+^"'^'plp2 , (2.4.27) 

and one finds that for Lorentzian signature ( — 1-++) we must have pi > and p2 > 0. 
One can verify that the metric, as it should be, has five Killing vectors 

oy ox ou 

d d d d d 

Z^ = — + {2 + a)y— + (2a - l)x— , W = ye~''— + x— , (2.4.28) 
oz ay ox ou oy 

which satisfy the algebra 

[W,Y] = -U, [W,X] = -Y, [Z^,U] = -5U, 
[Z^,Y] = -{2 + a)Y, [Z^,X] = {l-2a)X, [W, Z^] = {3 - a)W . (2.4.29) 

One finds that the Ricci tensor of fl2.4.26p is, in the coordinate basis, 

3(2 + aY 5e(^-^")-p3«(a - 3) 

n^u = gtiiy Of, Ou . (2.4.30) 

P2 P2 



There is a cosmological term with 



3(2 + ay 

A = ^ \ 2.4.31 

P2 
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consistent with p2 = —12/ A for the original Kaigorodov solution, and a traceless 
second term which describes null dust. In an orthonormal frame, given by 

dx , 
dx , 





= du + e'^'- 


-2a) ^ 


P3 


25piM^ 

^ 2P2 


2 




= du + e'^^- 


-2a) ^ 


P3 

2pi 


2hpiv? 


Pi 
2 




= ^Jvldz - 








E' 













(2.4.32) 



the nonvanishing components of T"** are 



yoo _ 2^01 _ 2^10 _ 2^11 _ 5p3a(3 a) ,^ ^ , 

P2pi 

The number f l2.4.33p is chosen to be positive so that T"^ describes an electromagnetic 
field; then depending on the value of a the sign of ps has to be chosen appropriately. 

Using deformations of the isometry group of the Kaigorodov solution we have 
constructed a spacetime which belongs to a more general class of "Lobatchevski plane 
gravitational waves" discussed in |149j . To bring the metric f l2.4.26|) into the standard 
form we apply the coordinate transformation 



^^_^^(2+«).^ y = VPr2/, u = -Pie'^u. (2.4.34) 
In these coordinates the metric becomes 

ds^ = \dq^ + df + 2dx du + H(q)dx^] , (2.4.35) 

(2 + a)^q^ '- -' 

with 

Hiq) = P3pt^ ((2 + «)g)^ . (2.4.36) 

In the notation of [154] . we have the metric form, 

3 

rfs2 ^ __ (^^^2 ^ ^~2 ^ 2dx du ± q^'^dx^) (2.4.37) 

with k = 1^ and the sign corresponding to the sign of p^. The condition for positive 
energy given in |154] is equivalent to 

P3(3-a)a>0, (2.4.38) 

which is consistent with the remarks below fl2.4.33p . The case k = —1, corresponding 

to Defrise's pure radiation solution |13] with enhanced symmetry, is the only value 
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of k which does not correspond to any choice of a. Metrics of the form fl2.4.37p are 
conformal to pp- waves and a more detailed physical interpretation is given in |149j . 

Similar to the previous case of the Petrov solution, using a deformation of the 
Killing algebra of a given spacetime we have constructed a solution with similar 
physical properties, namely in this case a pp-wave on an AdS background which now 
describes gravitational radiation as well as a null Maxwell field. 

2.5 Summary 

We have seen how the mathematical theory of Lie algebra deformations can relate dif- 
ferent physical situations, in this case exact solutions of general relativity. The Petrov 
and Kaigorodov-Ozsvath solutions, the unique vacuum solutions without and with 
cosmological constant with simply-transitive four- dimensional isometry groups, were 
shown to be related by Lie algebra deformations; they were shown to be both geodesi- 
cally complete. While the Petrov solution displays pathological causal behaviour as 
first observed by Tipler |157j (for the more general class of Lanczos-van Stockum 
solutions), we were not able to determine the causal properties of the Kaigorodov- 
Ozsvath solution. There is however strong indication that the latter shows causal 
pathologies since we found an unphysical energy-momentum tensor on the boundary 
in the context of AdS/CFT. We have to leave a more concrete study of the relation 
of these two properties to further work. Another interesting question that has been 
left open is whether the Kaigorodov-Ozsvath solution can be understood as a certain 
limit of an exterior solution of a matter distribution, perhaps representing a rotating 
cylinder as wellj**] More generally its physical interpretation which is so far largely 
unknown deserves further study. 

In a second calculation, we analysed deformations of the Killing algebra of the 
Kaigorodov spacetime and saw that one is naturally led to a solution including an 
electromagnetic field. Although we have not constructed new exact solutions here, we 
have shown how this more algebraic approach to exact solutions of general relativity 
can give new insights on physical properties. 



**A class of rotating cylinder solutions with negative cosmological constant has been studied in 
[37], who discuss the appearance of CTCs as well, but leave out certain limiting null cases. Our 
study strongly suggests that the Kaigorodov-Ozsvath solution, if related to the results of :27 , would 
be a limiting case not considered in \27] . 
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Chapter 3 

Deformed General Relativity and 
Torsion 

"Seit die Mathematiker iiber die Relativitdtstheorie 
hergefallen sind, verstehe ich sie selhst nicht mehr. " 

Albert Einstein 

3.1 Introduction 

In the previous chapter, we have discussed exact solutions of general relativity which 
are homogeneous spaces G/H. Since these are special cases with more symmetry 
than one would expect in a generic physical situation, it is natural to investigate a 
more general geometric description in which the symmetry of a given homogeneous 
space is only present infinitesimally. 

The passage from special to general relativity can be understood as making the 
symmetries of a homogeneous space local: General relativity is formulated in terms 
of (pseudo-)Riemannian geometry, which makes the symmetries of Euclidean space, 
which can be thought of as a homogeneous space E{p,q)/SO{p,q) Q, local, and can 
thus be formulated on arbitrary manifolds. Physically, this is the incorporation of 
Einstein's equivalence principle: At at a given point, any spacetime looks to an inertial 
observer like Minkowski space. In the vier-/vielbein formalism of general relativity 
(originally due to Cartan as well as the Cartan connections we will use in this chapter), 
one chooses a basis of the tangent space at each point, and thus a "moving frame" of 

*For p spacelike and q timelike dimensions. We will set p ^ d — 1 and q ~ I in the following. 
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vector fields, such that the metric in this frame is constant and diagonal. The frame 
field then encodes the metric information of the manifold. 

In this chapter, we shall investigate a generalised formulation of general relativity 
in terms of Cartan geometry which incorporates the symmetries of an arbitrary 
given homogeneous space G/H on an infinitesimal level, thus providing an extension 
of both Felix Klein's Erlangen Programme [96] and (pseudo-)Riemannian geometry, 
as summarised in the following picture that we essentially take from |148] : 



Euclidean 
Geometry 



curvature 7^ 



generalise 

transformation group 



Klein 

Geometry 



curvature 7^ 



(pseudo- ) Riemannian 
Geometry 



generalise 
tangent geometry 



Cartan 
Geometry 



Figure 3.1: Cartan geometry extends both Klein geometry and (pseudo-)Riemannian ge- 
ometry. 

In general relativity, if an orthonormal frame has been found, one can still use 
general local Lorentz transformations to obtain a different frame with the same prop- 
erties, and so in this formulation general relativity appears as a gauge theory. The 
description of general relativity is in terms of an [}-valued Ehresmann connection 
(with H = SO{d — 1,1), the Lorentz group), together with a soldering form (the 
frame field) determining how the tangent spaces at different points are "soldered" 
to the spacetime manifold. The gauge group H acts on the tangent space which is 
viewed vector space R ' . 

We will use a similar gauge-theoretic formulation, but of a more general character: 
Instead of viewing Minkowski space as M*^^^'^, we will view it as a homogeneous space 
E{d — 1, 1)/S0{d — 1, 1). A description in terms of Cartan geometry will allow us 
to deform general relativity by replacing E{d — 1, 1) by its deformation, the de Sitter 
group SO{d, 1). We effectively try to construct a theory of gravity built on tangent 
de Sitter spaces which replace the usual (co-)tangent spaces in general relativity. As 

^^Tlic anti-de Sitter group SO{d — 1, 2) could be used equally well, and in fact is used in [153) . 
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we saw in the previous chapter, from the viewpoint of Lie group deformations this 
process is a very natural extension of the standard framework. In addition to the 
speed of hght c, a second scale k with dimensions of momentum is introduced on 
a fundamental level. It is tempting to speculate that this scale could be related 
to quantum gravity, although the basic idea of "deforming" the kinematic algebra 
has existed at least since Heisenberg, who thought in terms of atomic rather than 
quantum-gravitational scales. 

The motivation for investigating this possible generalisation of the usual for- 
mulation of Einstein-Cartan theory, or general relativity in first order formulation, 
comes from the recently proposed framework of deformed, or doubly, special rela- 
tivity (DSR), where one effectively proposes a momentum space that is acted on 
by the de Sitter, not the Poincare group, rather in the spirit of Snyder's proposal 
|151] . It seems natural to try to construct a general geometric framework which in- 
corporates this symmetry on the infinitesimal level. We will use a formulation of 
Einstein-Cartan theory given by Stelle and West, and interpret it in the concrete ex- 
ample of Minkowski space. By doing this construction, we hope to shed light on the 
physical interpretation of DSR, which seems rather unclear at present. In particular, 
while the noncommuting translations appearing in the de Sitter group are commonly 
interpreted as noncommuting "spacetime coordinates" , and one passes to a noncom- 
mutative geometry description to account for this, our discussion will stay within 
conventional differential geometry. We will see that the description of Minkowski 
space in this theory includes a connection with torsion instead of a fundamentally 
noncommutative structure. 

In order to confront this mathematical framework with physical reality, we note 
that such a connection with torsion may lead to the usual ambiguities in minimal 
coupling. We give estimates concerning observable effects and note that observable 
violations of charge conservation induced by torsion should happen on a time scale 
of 10'^ s, which seems to rule out these modifications as a serious theory. Our con- 
siderations show, however, that the noncommutativity of translations in the Snyder 
algebra need not correspond to noncommutative spacetime in the usual sense. 

To set the scene, we start with the Einstein-Hilbert action in four dimensions, 
without cosmological constant, written in first order (Palatini) form: 



where i?^'^ is the curvature two-form of an so(3, l)-valued connection one-form a;°b. 
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One discovers that variation with respect to the vierbein field e" gives the usual 
vacuum field equation RiCa = (where RiCa is the Ricci one- form), while variation 
with respect to a;"?, gives the condition of vanishing torsion. Thus, this formulation 
has the advantage over the second order form that the condition of a torsion-free 
connection does not have to be put in by hand. On the other hand, if one now adds 
a matter action to the Einstein-Hilbert action, both equations now potentially have 
a non-zero right-hand side. While the functional derivative of the matter action with 
respect to the vierbein determines an energy-momentum tensor, one will also have 
nonvanishing torsion if 

^-P^T^O. (3.1.2) 

This is the case for matter with internal spin, such as spinor fields. This generali- 
sation of general relativity which incorporates the possibility of torsion is known as 
Einstein-Cartan theory. Since torsion does not propagate, i.e. the field equation 
contains the full torsion two-form instead of contractions of it (as for curvature in Ein- 
stein's equations for d > 3), it seems difficult to rule out such a theory experimentally, 
since one would only measure torsion inside a matter distribution with spin. 

The basic observation is now the following: Combine the connection u"'b and the 
vierbein e" into a single connection, taking values in the algebra of the Poincare group. 



A 



( \ 

, ,a 1 A 

UJ b jG 



(3.1.3) 



V 0/ 

where L is a length introduced to get dimensions right, but which is completely 
arbitrary at this stage. Then consider the Yang-Mills-like action (with now d = 4) 

^ = I '"^^^ = I r'^^^'^'^'^^^'^Pr , (3.1.4) 

involving the curvature F"'' of the connection A°'b (Latin indices here run from 1 to 4, 
and so one projects F"-^ to its so (3, 1) part in this action). Writing out F"-^ in terms 
of the curvature R""^ of uj°'b and the vierbein field e*, one discovers that the action 
(13.1.41) contains a topological Gauss-Bonnet term plus the Einstein-Hilbert action 
(13.1.11) . Thus, the field equations of Einstein-Cartan theory are recovered. 

The idea that general relativity may be interpreted as a gauge theory with the 
Poincare group as gauge group is of course far from new. In the usual textbook 
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discussions, the significance of a gauge tlieory is expressed as tlie promotion of a 
global symmetry in a given theory to a local one by the introduction of an additional 
field which appears in gauge- covariant derivatives. This viewpoint was probably first 
elaborated in detail by Utiyama p^J, who showed that the promotion of a global 
to a local symmetry directly leads to a gauge field, valued in the Lie algebra of the 
symmetry group under consideration, with the usual transformation properties, and 
thus geometrically to the well known formulation in terms of connections on principal 
bundles (see e.g. |12U] ). Utiyama also considered the Lorentz group as an example 
and was led to general relativity, but only by making several assumptions, such as 
the existence of a Riemannian metric and a local Lorentz (vierbein) frame and the 
symmetricity of the affine connection in its lower indices, without further motivation. 
It was then realised by Kibble [HI] that the more natural gauge group to take is 
the Poincare group, so that the vierbein appears naturally as part of the connection. 
Kibble showed that one is led to general relativity in the first order formalism, i.e. 
to Einstein-Cartan theory, and so to a connection with torsion when matter with 
internal spin, such as fermionic matter, is present. The appearance of torsion when 
fermions are coupled to gravity had earlier been noted by Weyl [165] !^ 

The result of MacDowell and Mansouri |110] is essentially that by considering a 
connection taking values in the algebra so(4, 1) of the de Sitter group, one obtains a 
cosmological term in addition to the Einstein-Hilbert action f l3.1.ip . (MacDowell and 
Mansouri also extended the calculations to supergravity, which we will not discuss 
further.) We will see that the result is not just a mathematical trick, but has a 
geometric interpretation in terms of a Cartan connection. 

The rest of this chapter is organised as follows: We discuss the idea of a curved mo- 



mentum space in section 13721 and give a brief introduction into the ideas of deformed 
(doubly) special relativity (DSR) most relevant to the following discussion in sec- 
tion 13.31 In section 13.41 we outline how Einstein-Cartan theory can be formulated 
as a gauge theory of gravity with the de Sitter group SO{d, 1) as gauge group; this 
theory includes a gauge field that plays a crucial role in what follows. In this section 
we essentially re-derive the results of Stelle and West, using a different set of coordi- 
nates which we find more closely related to the DSR literature. This section uses the 



•l- Rather amusingly, Weyl wondered in |165] whether the "gauge transformations" of electromag- 
netism should not be called "phase transformations" instead - the term "gauge transformation" had 
of course been introduced by Weyl himself |163] in the context of local conformal transformations 
(changes in "gauge"). 
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language of gauge theory familiar to physicists; we then give a more mathematical 
account of the geometry of Cartan connections in section I3.5[ 

We then return to a concrete example: To justify our claim that this geometric 
framework can be used to generalise the ideas of DSR, we show in section 13.61 how, 
if spacetime is taken to be Minkowski space, the simplest non-trivial choice of zero 
section leads to a connection with torsion, providing a geometric interpretation for 
the noncommuting "coordinates" appearing in the Snyder algebra. We close with a 
discussion of our results and their possible physical implications, which show that the 
theory, at least in its given form, is not physically viable. We conclude that there 
may be different physical interpretations of algebraic commutation relations such as 
those used in DSR. 

Since the two most obvious extensions of general relativity are admitting either 
connections with torsion or non-metric connections, we briefly discuss the theory of 
a torsion-free non-metric connection, known as symmetric affine theory, in a final 
section. We will see that it does not fit as well into a description by Cartan geometry 
as the case highlighted in this chapter. 



3.2 Curved Momentum Space 

It is commonly assumed that quantum gravity sets a fundamental length scale, the 
Planck scale |137] . which can not be resolved by any physical experiment. Different 
approaches to quantum gravity, such as string theory or loop quantum gravity, in- 
corporate such a scale. This leads to the idea that some kind of "space discreteness" 
should be apparent even in a low-energy "effective" theory. 

The idea of putting quantum mechanics on a discrete latticql seems to have been 
first considered by Heisenberg in the spring of 1930 |13U] . in an attempt to remove the 
divergence in the electron self-energy. Because the absence of continuous spacetime 
symmetries leads to violations of energy and momentum conservation, this approach 
was not pursued further, but later in the same year Heisenberg considered modifying 
the commutation relations involving position operators instead |13U] . 

A fundamental length scale is absent in special relativity, where two observers 
will in general not agree on lengths or energies they measure. Hence the usual ideas 
of Poincare invariance must be modified in some way. Snyder observed [ISIJ that 



With spacing equal to the Compton wavelength of the proton, ~ 1-3 fm 
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this could be done by deforming the Poincare algebra into the de Sitter algebra, i. e. 
considering the isometry group of a (momentum) space of constant curvature. As we 
have seen in chapter HI if one maintains the structure of a Lie algebra and considers 
deformations of the Poincare algebra, replacing it by the de Sitter (or anti-de Sitter) 
algebra is the unique way of implementing a modified kinematic framework [12j . 

A (i- dimensional de Sitter momentum space with curvature radius k, is defined as 
the submanifold of a (rf + l)-dimensional flat space with metric signature {d, 1) by 

where k has dimensions of mass. Its isometry group is generated by the algebra 

[Mab, Mcrf] = r]acMbd + VbdMac " VbcMad " VadMbc , 

[Xa, Mbc] = VacXb - VabX, , [X„ X,] = ^M,, . (3.2.2) 

Here Mab correspond to a Lorentz subalgebra of the de Sitter algebra, while Xa = 
^Mci+i^a are interpreted as (noncommuting) translations. These translations are then 
interpreted as corresponding to coordinates on spacetime; Snyder thought of operators 
acting on a Hilbert space. Since the operators Xi, . . . ,Xii-i correspond to compact 
rotations in the {d + l)-dimensional space, their spectrum is discrete. In this way, 
one obtains "quantised spacetime", while maintaining Lorentz covariance. 

It should be stressed at this point that we are interested in the full isometry group 
acting on momentum space, i.e. the Poincare group in the case of a Minkowski space. 
Clearly, an equation such as (13.2. ip defines a scale k already in the ((i+l)-dimensional 
Minkowski space; this scale would however not be invariant under translations of this 
space. In other words, there is no natural length scale in Minkowski space, whereas 
there is one in de Sitter space, namely the scale k defining the manifold. There has 
been some confusion in the literature (see e.g. |147] ) about the way in which Lorentz 
invariance is deformed in the Snyder algebra and the more recent ideas of DSR that 
we will review shortly. It should be clear from the commutation relations f l3.2.2p that 
the Lorentz subgroup generated by Mab is not deformed in any way. 

One can give explicit expressions for the algebra elements by choosing coordinates 
on de Sitter space (13.2. ip . The choice made by Snyder is taking Beltrami coordinates 

1 2 d P'^ / X 

P = , P = f^J^i , • • • , P = , (3-2.3) 

whence one has (P"'+i)2 = ^^/(/t^ + rjabP^'p^) to satisfy (13.2.11) . and rjabP^'p^ > —k^, 
corresponding to an apparent maximal mass if p"" were interpreted as Cartesian co- 
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ordinates on a Minkowski momentum space. (Up to this point one could in principle 
have chosen anti-de Sitter instead of de Sitter space. Then this inequality becomes 
labP'^P^ < K^, which perhaps seems less motivated physically.) A necessary sign choice 
means that these coordinates only cover half of de Sitter space. In these coordinates, 
the translation generators 

X =-( P'^^'^ - P = — + -V v'— (3 2 4) 

generate "displacements" in de Sitter space. (In this notation, indices are raised and 
lowered with rjab, the c?- dimensional Minkowski metric, so that pa = rjabV^-) 

The motivation behind these ideas was to cure the infinities of quantum field the- 
ory, which evidently arise from allowing arbitrary high momenta (or short distances). 
In a somewhat similar spirit, Gol'fand suggested [73 j to define quantum field theory 
on a momentum space of constant curvature, using Beltrami coordinates as momen- 
tum variables. This makes the volume of the corresponding Riemannian space finite 
and so presumably leads to convergent loop integrals in the Euclideanised theory. The 
consequences for standard quantum field theory were further explored in [221 El]- 

Gol'fand only assumed that k ^ m for all elementary particles; thinking of quan- 
tum gravity, one would perhaps identify k with the Planck scale, whereas the original 
authors seem to have thought of the Fermi {i.e. weak interaction) scale. 

The induced metric on de Sitter space in terms of the coordinates p° is 

gnr = Vnr 5- , (3.2.5) 

where p ■ p = rjcdP'^p'^- The metric (I3.2.5P becomes singular when p ■ p ^ —i^^, and 
negative definite when extended to what Gol'fand calls the exterior region p-p < —k^. 
In four dimensions, 

det g = -k^°{k'^ + p-p)-^ , (3.2.6) 

and the volume element is d^p k^{k? + p ■ p)~^/'^. 

In Gol'fand's approach (assuming = 4 of course), the standard Feynman rules 
were modified by replacing the addition of momenta p and A; at a vertex by 

{p{+)kY = ^ fpV«:2 + k-k + r f , , )) , (3.2.7) 

- p-k \ \ K + V k2 + k - kj J 

which corresponds to a translation by k of the vector p. (Again p ■ k = r]abP°'k^, etc.) 
It was also noted that spinors now transform under "displacements" as well, which is 
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made more explicit in [92j and [71]. As is well known, five-dimensional Dirac spinors 
still have four components and the matrix 7^ appears in the Dirac Lagrangian, hence 
there is no chirality. This alone seems to imply that the original Gol'fand proposal 
cannot be used for an appropriate model of the known particles. 

Gol'fand's approach is very different from more recent approaches to quantum field 
theory on noncommutative spaces (see e.g. [H]) in that the field theory is defined on 
a momentum space which is curved, but neither position nor momentum space are 
noncommutative in the usual sense. 

3.3 Deformed Special Relativity 

The idea that the classical picture of Minkowski spacetime should be modified at 
small length scales or high energies was re-investigated in more recent times, moti- 
vated by the apparent existence of particles in ultra high energy cosmic rays whose 
energies could not be explained within special relativity [S] . The proposed framework 
of deformed special relativity (DSR) [1] modifies the Poincare algebra, introducing 
an energy scale k into the theory, in addition to the speed of light c. This leads 
to a quantum (/€-)deformation of the Poincare algebra |113] . with the parameter k 
associated with the newly introduced scale. 

It was soon realised [100] that this deformed algebra is the algebra of the isometry 
group of de Sitter space, and that the symmetries of DSR could hence be obtained by 
identifying momentum space with de Sitter space, identifying Xa as the generators of 
translations on this space. The constructions of DSR thus appear to be a resurrection 
of Snyder's and Gol'fand's ideas. We take this observation as the defining property of 
DSR, and will seek to describe a framework in which momentum space, or rather the 
(co-)tangent space in general relativity, is replaced by an "internal" de Sitter space. 
We will see that this can best be done using Cartan geometry. 

When discussing DSR as a modification of special relativity, we take the view 
that special relativity is defined as a kinematic framework with preferred inertial 
systems, related to one another by (proper) Lorentz transformations. That is, one 
has a fiat spacetime on which there exist certain preferred coordinate systems, those 
in which the metric is diagonal with entries ±1. From this point of view, the choice 
of coordinates on the internal de Sitter space plays quite an important role if one is 
looking for a "deformation" of special relativity including an energy scale k. Such 
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a deformation can only arise if the chosen coordinate system reduces to Cartesian 
coordinates on Minkowski space as k — )■ cxd. The choice of coordinates is obviously 
not unique. 

The generators of the algebra will take different explicit forms when different 
coordinate systems (on four- dimensional de Sitter space) are chosen. In |100] "natural 
coordinates" are defined by, in the notation of section 13. 2[ 



g = exp [p\Mm + Xi)] exp O , (3.3. 1) 

where O = (0, 0, 0, 0, k) is taken to be the origin of de Sitter space in five-dimensional 
Minkowski space, and M/5 and M45 correspond to translations in space and time. 
The coordinates one obtains are related to the five-dimensional coordinates by 

P = p e « , P = Ksmh — H e « , P = /tcosh — e « . (3.3.2) 

\ K J 2k \ k J 2k 

Again, these cover only half of de Sitter space where + > 0. The metric in 
these "flat" coordinates is 

ds^ = -{dp^y + e^6ijdp^ dp^ . (3.3.3) 

Shces of constant p^ are fiat; to an observer using these coordinates the spacetime 
appears as expanding exponentially. An illuminating discussion of different coordinate 
systems and kinematics on de Sitter space is given in [29]. 

The Magueijo-Smolin model [112] corresponds to the following choice of coordi- 
nates: 

, P^ . P^ . P^ . P^ , , 

The generators of boosts in de Sitter space take the form 

^'-/#i + P'#7 + V//7, (3.3.5) 

Op^ Op^ K Op'^ 



and translations (not considered by the authors of |112] ) would take the form 

^ p^ + K d ^ 1 ^^^b 9 ^ -p''— + P* + '^ ^ (3 3 g) 

K dp^ K^ dp^ ' K dp^ K dp^ 



^Capital Latin indices such as / and J used in this section only run over spatial coordinates (from 
1 to 3). 
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This choice of coordinates is somewhat pecuhar as takes a special role, as is also 
apparent from the modified dispersion relations presented in |112] . The quantity 



is invariant under boosts and rotations in de Sitter space, as would rjabP^'p^ be in 
Beltrami coordinates. 

Each DSR model corresponds to a choice of coordinates on de Sitter space, such 
that all expressions reproduce the expressions for special-relativistic Minkowski co- 
ordinates as K — 7- oo. What Magueijo and Smolin call a "[/ map" is essentially 
a coordinate transformation from Beltrami coordinates to a different set of coordi- 
nates, which becomes the identity as k — ?■ oo. In the remaining sections we shall use 
Beltrami coordinates. Note that this means we always have p ■ p> —k^. 



The most direct implementation of the ideas discussed so far into a framework describ- 
ing more general spacetimes is replacing the cotangent (or tangent) bundle usually 
taken as phase space by a general symplectic manifold which can be locally 

viewed as a product U x D of a subset U d M. oi spacetime M. with de Sitter space 
T). We want to retain the differentiable structure of a manifold, which we do not 
assume to be present in a full theory of quantum gravity. We also assume that the 
structure of momentum space is fixed and in particular does not depend on matter 
fields, as suggested in |119] . 

If phase space is described as such a manifold, with a choice of origin in the 
"tangent" de Sitter space at each point, the appropriate mathematical language is 
that of fibre bundles. The theory of connections on fibre bundles of this type, called 
homogeneous bundles in [51], was developed by Elie Cartan {e.g. in [32j). Adopting 
this framework means there is now an so(c/, 1) connection, instead of an so (c? — 1, 1) 
connection, defining parallel transport on spacetime. 

It was noted by MacDowell and Mansouri [110] that gravity with a cosmological 
term in four dimensions could be obtained from a theory of such an so((i, 1) connection 
by projecting it onto its so (d — 1, 1) part in the action. A more elaborate description 
in terms of Einstein-Cartan theory was then given by Stelle and West [153j. Their 
analysis included the gauge field needed to identify the fibres at different spacetime 




(3.3.7) 




3.4 A de Sitter Gauge Theory of Gravity 
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points, which will be crucial for the interpretation of the theory. The mathematical 
side of MacDowell-Mansouri gravity as a theory of a Cartan connection is nicely 
illustrated in |166] : we follow this article as well as the more computationally based 
presentation of |153] . who use the language of non-linear realisations. An overview 
over the mathematics of Cartan connections is given in (148] . 

For clarity we first describe the framework in a language more common to physi- 
cists; a more mathematical account of Cartan connections on homogeneous bundles 
is given in the following section 13.51 

The usual description of general relativity as a gauge theory of the Lorentz group, 
known as vier-/vielbein formalism, was reviewed at the beginning of this chapter. 
Since the tangent bundle is in our description replaced by a homogeneous bundle 
with a curved "tangent" space, one has to effectively use a "double vielbein" formal- 
ism, in which spacetime vectors are mapped to vectors in the tangent space to the 
internal (curved) space by a soldering form (vielbein). The picture we have in mind 
is that of a de Sitter space rolled along the manifold. One then needs to introduce 
a new field which specifies the point of tangency, expressed in a given coordinate 
system on the internal space, at each spacetime point. We denote it by p'^(x). This 
corresponds mathematically to a necessary choice of zero section (see below), and 
physically to a gauge field. Picking a point of tangency at each spacetime point 
breaks the gauge group SO{d, 1) down to the Lorentz subgroup SO{d —1,1) leaving 
the point of tangency invariant. 

In more general terms |167] . one has a principal bundle over a base manifold Ai 
with gauge group G. Locally, a connection on this bundle can be viewed as a one- 
form taking values in the Lie algebra g. One then considers the projection of the 
connection to its g/f) part and demands that at any point in the manifold, this be an 
isomorphism between the tangent space at this point and the algebra g/f). In other 
words, the g/f) part written as a matrix, has to have full rank. This of course 
means that the dimension of g/f) must be equal to the dimension of the manifold. 
The gauge field p°'{x) now comes in because the projection of the connection to its 
g/() part is not canonical; for a given subgroup H G G viewed as the stabiliser of a 
particular point in G/H, one could equally well take a conjugate group gHg^^ ~ H 
stabilising a different point in G/H. This will become more explicit shortly when we 
consider the MacDowell-Mansouri action. 

The procedure does not depend on the choice of gauge group G, and would be 
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equally possible if the gauge group were chosen to be the Poincare group E{d — 1,1); 
such a formulation for gravity, including copuling to matter, is indeed discussed in 
|75j . In that case, the homogeneous space G/H would be Minkowski space, viewed as 
an affine space. As the value of the gauge field p°'{x) at a particular spacetime point 
picks an origin in this affine space, it may then be identified with the vector space 
j^^ ^ given point in Minkowski space, the tangent space to this point can 
be identified with Minkowski space itself. Although all calculations we will present 
go through in the case of E{d — 1,1) as gauge group (resulting in different formulae 
of course), the formalism may seem a bit redundant since there is no real necessity 
for the second vielbein field. In the more general case of a homogeneous space G/H 
which acts as a "tangent space" instead of a fiat vector space, it is however necessary. 

We consider a theory with gauge group SO{d, 1), so that the connection A takes 
values in the Lie algebra 5o{d, 1). It can be split as (again introducing a length / on 
dimensional grounds) 

/ \ 



A 



a 1 i 

00 b ye 



(3.4.1) 



so that oo"-b acts as the usual so((i — 1, l)-valued connection of general relativity and 
e* as a vielbein one-form. We have simultaneously unified the usual connection and 
the vielbein, and replaced the (fiat) tangent space by a curved "internal" space, such 
that the de Sitter group and not the Poincare group now appears as a gauge group. 
(Lorentz) indices on u"'b and e* are raised and lowered using rj"'''. 

A gauge transformation, i.e. a local transformation g{x) taking values in the de 
Sitter group, can be split as g{x) = s{x)A{x), where s{x) changes the zero section, 
i.e. changes the local identification of points of tangency at each spacetime point, and 
A(x) is a local Lorentz transformation in the vielbein formalism of general relativity 
which does not mix the u"'b and e* parts of the connection. In this notation, the 
connection transforms under a gauge transformation as 

A{x) A'{x) = A'^{x)s''^{x)A{x)s{x)A{x)+A^^{x)s'''{x)ds{x)A{x)+A^^{x)dA{x) . 

(3.4.2) 

One can use this equation to relate the connection Aq corresponding to the triv- 
ial zero section, where the point of tangency is the origin of the internal space at 
each spacetime point, p°'{x) = (0, 0, 0, 0), to a connection corresponding to any given 
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zero section. The physical significance of this is the following. Assume we have 
fixed = (0,0,0,0). Then an action can be defined from the curvature of the 

connection A (here R is the curvature of the so{d — 1, 1) part of A), 



F = dA + AAA 



( \ 

R\ -pie" A eb) \T' = } {de' + uj'^ A 



(3.4.3) 



In four dimensions, the MacDowell-Mansouri action |110[ 1166] is 

■' = -3^1 (^°' = -32IJA / r-'^'-'-C^?? . (3.4.4) 

where the Latin indices run from 1 to 4, and so one projects F to its so(3, 1) part in 
this action. 

Apart from a topological Gauss-Bonnet term, the action fl3.4.4p is equivalent to 
the Einstein-Hilbert action with a cosmological term 

S = Y^l, I ea,c, (e" A A R^' - i^e» A A A e^) , (3.4.5) 
where we have to identify 

A = ^ (3.4.6) 

as the cosmological constant 

In order to define the projection of F in the action fl3.4.4p . one has used a splitting 

so{d, 1) ~ 5o{d, l)/5o{d - 1, 1) © 5o{d -1,1), (3.4.7) 

which depends on the gauge field since the subgroup SO{d — 1, 1) leaving a given 
point in de Sitter space invariant depends on the choice of this point. 

When the action fl3.4.4p is coupled to matter, the so{d, l)/so{d — l, 1) part e" of the 
connection appears in a volume element in the matter Lagrangian. By varying the 
action one obtains the field equations of Einstein-Cartan theory with a cosmological 
constant A = 3//^. The length scale /, which is so far arbitrary, can be chosen 
to reproduce the A of the observed universe, which means it must be chosen to be 
very large (the "cosmological constant problem"). By the field equations, one can 



"Note that in the formulation in terms of the Poincare group, with connection (j3.1.3p . no cosmo- 
logical constant appears and the length scale L remains undetermined. 
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determine for a given matter distribution a connection Aq consisting of an 5o{d — 1, 1) 
connection {u°'b)o and a vielbein Cq. 

The MacDowell-Mansouri action reproducing Einstein-Cartan theory with a cos- 
mological constant includes a gauge choice. We can hence view it as the gauge-fixed 
version of a more general theory. Since fl3.4.2p determines how the connection trans- 
forms under a gauge transformation, we can generalise a given solution of Einstein- 
Cartan theory to an arbitrary gauge choice. The extension of the theory to arbitrary 
configurations of the gauge field, and hence arbitrary choices of tangency points of the 
internal space to spacetime, is what we call Einstein-Cartan-Stelle-West theory. 
Any solution of Einstein-Cartan theory, in particular any (torsion-free) solution of 
general relativity, gives rise to more general solutions of Einstein-Cartan-Stelle-West 
theory via fl3.4.2p . We will later see that one can construct an so((i — 1, 1) connection 
with torsion from a torsion-free one. 

In (13.4.21) . s{x) takes values in the de Sitter group, more precisely in the subgroup 
generated by "translations" which leaves no point of de Sitter space invariant. The 
correspondence between Beltrami coordinates p"-{x) on de Sitter space and such group 
elements is given explicitly by 



s{p{x)) = exp 



:Artanh V^^^Ml^Ux, 



(3.4.8) 



Then the group element s{p{x)) maps (0,0,0,0) to {p^{x),p'^{x),p^{x),p'^{x)) in Bel- 
trami coordinates. A different choice of coordinates in the internal de Sitter space 
would correspond to a different parametrisation of the elements of the subgroup of 
translations of the de Sitter group. 

Inserting (13.4.81) into (I3.4.2P and setting A(x) = e, we obtain 

= + f 1 - + + U" - (a ^ 6) , 



lK-f{p) V lip) J P-P 2 

In f iPcdp" 

I P 

p ■ p + n'^ \ p ■ p 



^\pi^)) = z—r2 ( P -^i^ - 7(P)) + dp'lip) + {uj\),p'i{p) 



l + ^(l-7(p)) ei 

+P'elpa '-'^ ^ + ^ , (3.4.9) 

p ■ p + K"^ 7(pj 



where 



p ■ p + p ■ p 



Because p-p > — in Beltrami coordinates, the square root is always real. In the 
limit p - p — )■ 0, our parametrisation is the same as that used in |153j . and we recover 
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their results 

u^'^Mx)) = (^l^of + ^P^'el + ^ {p^dp" + c^oVpc)) -{a^b), 
e\pix)) = ei + ^ (^-^p'pcdp^^ + dp' + + ^P'^oPa ■ (3.4.11) 

Near p = 0, we have 

uj''\p{x)) = u'o' + 0(-] , e'ip{x)) = + -dp' + O (^) . (3.4.12) 

As mentioned above, the 5o{d, l)/so{d — 1, 1) part of the connection A acts as a 
vielbein and maps vectors in the tangent space at a point x in spacetime to vectors 
in the tangent space at p{x) in the internal de Sitter space, given in components 
with respect to an orthonormal basis at p{x). In order to give their components in 
the coordinate- induced basis {^}, we need another vielbein, which can be obtained 
from fl3.4.9p by setting uq = Cq = (corresponding to spacetime being de Sitter space 
with cosmological constant A) and p°'{x) = yx", as in |153] . We obtain 

C(pM) = .- '"°<P-f''P'-''°P"'y-^' . (3.4.13) 

{p-p){p-p + K,^) 

where n is a coordinate index in the internal space and a denotes a Lorentz index, as 
before. This vielbein is of course independent of the underlying spacetime. 

Parallel transport can be defined for the so((i, 1) connection using the notion of 
development, which generalises the usual covariant derivative. One introduces a de- 
velopment operator |153j 

D = d-]^uj'''Mab-{e-V), (3.4.14) 

where the second term is the usual infinitesimal relative rotation of tangent spaces at 
different spacetime points, and the last term compensates for the change of point of 
tangency and hence generates maps from the tangent space at one point of de Sitter 
space to the tangent space at a different point of de Sitter space. Again one should 
think of an internal space rolled along spacetime |166j . 
In components, in our conventions we have 

{uj''''Mai,y^ = -2uj'd, (3.4.15) 



and the combination e"\4 acts on Lorentz indices as an element of 5o{d — 1, 1), rep- 
resenting the map from one tangent space to another in the respective bases. We use 
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the result obtained by |153j using the techniques of non-hnear reahsations, namely 
that when expressed as an so (rf — 1, 1) matrix, 

l{e ■ V) = Ks{p)-\e'^Xa)s{p) - s{p)-' [s{p + 6p) - s{p)] , (3.4.16) 

where s{p) is defined according to fl3.4.8p and 6p is determined from the equation 

[s{p + Sp)r, = [{l + e'X,^)s{p)]\ (3.4.17) 

where only terms linear in e° are kept in 6p. An explicit calculation shows that 

5p" = -(r/6cey) + e'^«, (3.4.18) 

K 

and hence near p = 0, we have dp"' = KjC , clS expected. We find that (e ■ V) has 
components 

(e . V)\ = ^(^Vc- e /)(l-7(p)) _ ^3_4_,g) 
l{p ■ p) 

One then has a notion of holonomy, mapping closed loops in spacetime into the 
internal space by development. In particular, if one develops the field p{x) describing 
the point of tangency around an infinitesimal closed loop at Xq, the developed value 
will in general differ from the original value at xq |153j : 

Ap"(xo) (X T^u\xo)l\{p{xo)) jx^dx" , (3.4.20) 

where l°'i{p{x)) is the inverse of the vielbein fl3.4.13p and T^j,* are the components of 
the torsion tensor T = de + u A e. This is because by specifying p"(x), one locally 
identifies the "internal" de Sitter spaces, thinking of a tangent de Sitter space rolled 
along the manifold. In a generic situation, where the manifold A4 is not identical to 
the internal space, such an identification is, strictly speaking, only possible at a point. 
To understand the geometry, one may picture a sphere being rolled along a plane, as 



in figure 13.21 In general, when the sphere is rolled around and returns to its initial 
position on the plane, its point of tangency will not be the original one. This is just 
the situation for the Lorentzian analogues, Minkowski space and de Sitter space, as 
we will discuss shortly. 

The central result we will try to justify in the following is that, starting from 
Minkowski spacetime, if we assume the internal de Sitter space is rolled along Minkowski 
space in a non-trivial way, we obtain a connection with torsion. In our interpretation, 
this is the only way that "coordinates" can act as translations on momentum space, 
as one normally assumes when associating the Snyder algebra with a noncommutative 
spacetime. 



Deformed General Relativity and Torsion 67 



Figure 3.2: When the curved internal space is rolled along Minkowski space, a path in 
spacetime corresponds to a path in the internal space. Because of the curvature of the 
internal space, a closed path in Minkowski space does not correspond to a closed path in 
the internal space, which is manifest as torsion. (We have drawn Riemannian spaces, a 
plane and a sphere, for clarity of presentation.) 

3.5 Cartan Connections on Homogeneous Bundles 

This more mathematical introduction into Cartan connections on homogeneous bun- 
dles relies mainly on |166] . but mentions some additional points which are of impor- 
tance to our discussion of Einstein-Cartan-Stelle-West theory. 

The tangent bundle of a manifold needs to be replaced by a fibre bundle whose 
fibres are homogeneous spaces T> = SO{d,l)/ SO{d — 1,1). This can be achieved 
by starting with a principal bundle P{M, SO{d, 1)), and considering the associated 
bundle V = E{Ai,V, SO{d,l), P) = P y<so{d,i) (taken as phase space); it can be 
identified with P/SO{d — 1, 1) by the map 

ly-.V ^ P/SO{d- 1,1), [u,a- SO{d- 1,1)]^ ua- SO{d- 1,1). (3.5.1) 

Then the structure group SO{d, 1) is reducible to SO{d — 1, 1) if the associated 
bundle V admits a cross section a : M. ^ V [99]; furthermore, there is a one-to-one 
correspondence between reductions of the structure group and cross sections. This 
cross section, called a zero section in [135] , corresponds to a choice of origin in the 
momentum space attached to each point. In physicist's terms, the de Sitter group is 
spontaneously broken down to the Lorentz group by the choice of points of tangency 
in the tangent de Sitter spaces at each spacetime point. 

The bundle reduction depends on the choice of zero section, or rather, its local 
representation in coordinates as a function Ai D U ^ U x V. This is because the 
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embedding of S0{d — 1, 1) into SO{d, 1) is not canonical, as the stabilisers of different 
points in V are isomorphic but related by conjugation. In other words, the mappings 
appearing in the exact sequence 

^ SO{d -1,1)^ SO{d, 1) ^ SO{d, 1)/S0{d - 1, 1) ^ (3.5.2) 

are not canonically chosen (cf. the discussion for the affine group in |135] ). 

It is of course possible to choose canonical coordinates such that the function 
representing the zero section is just x i— )■ (x, [e]) = {x,SO{d — 1,1)) G x V. 
However, in general we want to locally identify the fibres at nearby base space points, 
adopting the viewpoint that there is a single tangent V space which is "rolled along" 
the manifold. Then we need to retain the general coordinate freedom. (This point 
is missing in the discussion of |166] .) An exact identification is only possible when 
the connection is flat. Let us assume that coordinates on V have been fixed, and 
that it is the zero section, and hence the identification of the fibres, that is variec|**. 
After a choice of zero section, there is still a local gauge freedom corresponding to the 
stabihser SO{d — 1, 1). We express a given section as s(x), where a{x) = (x, s(x)) G 

xV in our coordinates. The section that corresponds to Sq{x) = [e] will be called 
"trivial" . 

An so{d, l)-valued Ehresmann connection A in P is in general not reducible to an 
5o{d — l, l)-valued connection in the reduced 5*0(^ — 1, 1) bundle Pr{J^, S0{d — 1, 1)). 
It can, however, be pulled back using the inclusion 

i^: SO{d- 1,1) ^ SO{d,l), A ^ s{x)As{x)-^ (3.5.3) 

to a Cartan connection A^^ on the reduced bundljll]. Of course reducing the connec- 
tion to an so (d — 1, l)-valued connection and pulling it back to a Cartan connection 
are very different operations, since in the latter case one wants the so{d, l)/so((i— 1, 1) 
part of the pulled-back connection to act as a soldering form, so in particular to be 
non-singular. We obtain a bundle sequence (cf. |166] ) 

**one is free to choose an "active" or "passive" viewpoint here 

^^We assume here that the necessary condition ker A n {ix)*(TP]i(Ai, SO{d — 1, 1))) = {0} (see 
|148j ) is satisfied. 
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Pr{M, SO{d-l, 1)) P{M, SO{d, 1)) >- P/L^{SO{d - 1, 1)) ~ V 




M 

The reduced bundle Pr{M, SO{d - 1,1)) is mapped into P{M, SO{d, 1)) by 

p^[p,e] = {{pA-\ s{x) As{x)-^)\A e SO{d- 1,1)} 

e PniM, SO{d - 1, 1)) x,^(5o(d-i,i)) SO{d, 1) . (3.5.4) 

The connection one-form A on Ai, induced by the connection A on P, depends on 
a choice of section r : — )■ P{A4., SO{d, 1)). If the zero section a is fixed, one can 
choose an arbitrary (local) section r/j : — )■ Pr(A^, 5*0(^ — 1, 1)) to obtain a section 
r; in local coordinates, 

a{x) = {x,s{x)), tr{x) = ix,A{x)) 

— > r(x) = {x, s{x)A(x)s{x)~'^ ■ s{x)) = (x, s{x)A(x)) . (3.5.5) 

For practical computations, it is often useful to first consider the trivial section. The 
induced connection corresponding to this section, denoted by Ao{x), is related to the 
connection for a general section by 

A{t{x)) = A~^{x)s'^{x)Ao{x)s{x)A{x) + A~^{x)s'^{x)ds{x)A{x) + A~^{x)dA{x) . 

(3.5.6) 

Once the zero section s{x) has been fixed, there is still the freedom of SO{d —1,1) 
transformations, corresponding to different choices of A(x) in 03.5.61) . These are the 
standard local Lorentz transformations in the vielbein formalism of general relativity. 

The choice of zero section induces a local splitting of the so{d, 1) connection, 
according to 

so{d, 1) ~ 5o{d, l)/so{d - 1, 1) © so{d -1,1). (3.5.7) 

This splitting is invariant under the adjoint action of SO{d —1,1), thus the different 
parts of the connection will not mix under SO{d — 1, 1) transformations. We are 
considering a reductive geometry (cf. fll.l.6p ): recall that the existence of such a 
splitting 

= y©l) (3.5.8) 
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invariant under the adjoint action of iJ, implies that we may identify g/f) with the 
subspace y of g. In the present case, this is of course the subspace spanned by the 
generators M^+i ^ of SO{d, 1). For the de Sitter group, just as for the Poincare and 
anti de Sitter groups, one can say more: The Lie algebra 5o{d, 1) is equipped with a 
Z2 grading; the subspaces in fl3.5.8p satisfy 

[f),f)]C(), [f),y]Cr, [y,y]Cf) (3.5.9) 

and the homogeneous space G/H is a symmetric space |167] . 

The fact that de Sitter space viewed as a homogeneous space is a symmetric space 
is directly responsible for the splitting of the curvature F of the Cartan connection 
A into curvature and torsion of the 5o{d — 1,1) connection u that we have observed 
above |167] . A general Cartan connection A = u + e, where u takes values in f) and 
e takes values in y, has curvature 

F = R+^[e,e] + d^e, (3.5.10) 

where [e, e] denotes taking the wedge product on the form part and the commutator 
on the Lie algebra part, i.e. 

h = {e^ A e'^)^^ = C,\ [e^/^ - eje^) (3.5.11) 

if Cj' y. are the structure constants as in (11. 1.41) . The middle term generically takes 
values in [) and y, so that one has a splitting 

= i? + ^[e,e],, = ^[e,e]j + c/^e. (3.5.12) 

For a symmetric space, however, [e, ejy vanishes |l67j . and we obtain the result used 
in the construction of the MacDowell-Mansouri action (I3.4.4p . 

Because we assume A to be a Cartan connection, the so(c/, l)/so((i — 1, 1) part acts 
as a soldering form, corresponding to the standard vielbein of general relativity; in 
particular, is an invertible matrix. The soldering form maps vectors in the tangent 
space T^M. at a point x in spacetime to vectors in the tangent space Tp(^x)V at p{x) 
in the internal de Sitter space, given in components with respect to an orthonormal 
basis at p{x). The vielbein that maps between the components of a vector in the 
orthonormal basis and the coordinate-induced basis is given in (13.4.13p . 



Deformed General Relativity and Torsion 



71 



3.6 Synthesis 



The notion of development along curves in spacetime is central to the interpretation 
of Einstein-Cartan-Stelle-West theory, because it allows "spacetime coordinates" to 
act as translations in the internal de Sitter space. The situation described by DSR, 
where noncommuting translations on a curved momentum space are interpreted as 
noncommuting spacetime coordinates, here corresponds to a Minkowski spacetime 
with an internal de Sitter space rolled along this Minkowski space. The gauge field 
specifies the points of tangency of the internal space at each spacetime point, 
and we have chosen Beltrami coordinates on de Sitter space which look like Cartesian 
coordinates on Minkowski space near the "origin" of de Sitter space. Since the internal 
space has a natural scale k and we needed to introduce a natural scale / in spacetime, 
we choose the gauge field to be 



in a vicinity of the origin of spacetime which is now taken to be Minkowski space, 
where are the standard Minkowski coordinates such that the connection vanishes 
in general relativity. In general a closed path in spacetime will not correspond to a 
closed path traced out on the internal space, hence such an identification is only local 
and, strictly speaking, only valid the origin of Minkowski spacetime. On dimensional 
grounds, the effects of torsion scale as j or ^. For f l3.6.ip to be well-defined, we must 
guarantee that x ■ x > — Z^, so / should be large in Planck units. We will comment on 
the significance of the scale / at the end of this section. 

It should perhaps be emphasised that the gauge field p°'{x) does not represent 
physical momentum, but determines the point of tangency of the internal space we 
have introduced which is to some extent arbitrary. Tangent vectors to the original 
spacetime can be mapped to tangent vectors to the internal space via the vielbein. The 
physical interpretation of motion in an internal "momentum" space which is related 
to motion in spacetime seems obscure, but if coordinates are to act as translations 
in the internal space, the two must be connected in some way. In this sense, we are 
constructing the minimal non-trivial gauge field which leads to observable effects, and 
an alternative interpretation of noncommuting generators Xa in the Snyder algebra. 

In our interpretation, different points in the internal de Sitter space do not repre- 
sent different values for physical four-momentum. Hence we avoid problems with the 
physical interpretation of DSR, such as the "spectator problem" of noncommutative 
momentum addition and the "soccer ball problem" of how to describe extended ob- 
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jects, given that any momentum would appear bounded by the relation p-p > — k^. In 
our framework, tangent vectors representing a particle's (or extended body's) velocity 
remain vectors and as such live in an unbounded space with commutative addition. 

As explained before, we can use equations fl3.4.9p to obtain the connection com- 
ponents uj and e corresponding to this choice of our gauge field; we set = and 
(e/x°)o = and substitute (13.6.11) to get 



/' - (xV-^V) ^-;.;'^^!^i-^ . (3.6.2) 



= 7 ^7 -^[x'x^{x-x-2l\-i{x)-l)) + 5;{x-x)2l'-i{x) 

[x ■ x)[x ■ X + r) V 

and 

/2(2/2(7(a;) - 1) - 3(x ■ x)) - {x ■ xf 



d e ^ - d e ' = (x 5 ' - X 5 



[x ■ x)[x ■ X + P) 



2 



^b ^b i2P+X-x) ix-X + Pj^ix)-!)) , 

ujy e^fe-w^ = 7 TT , ,ox,n , . [XuOf, -x^^Oy ) , (3.6.3) 

[x ■ x)[x ■ X + P)P'^[x) V / 



where now 



which gives a non-zero torsion 



(3.6.4) 



7;^' = {xj; - Xf,8^') ^^^L^^ ■ (3.6.5) 

Interestingly enough, for the choice of zero section (I3.6.ip the scale n drops out of all 
expressions. Expressed in coordinates on the internal space, one has 

T,.^ = - r^WTf ■ ^3.6.6) 

The quantity T^^/* will be multiplied by an infinitesimal closed loop § x^dx" to give 
the difference in the value p{x) caused by development along this loop. In momentum 
coordinates, this is equal to ^ § p^dp", and the effect of going around the developed 
curve in the internal space is (near a; = or p = 0) proportional to k"^, just as was 
suggested by (I3.2.2p . 

Expressing Minkowski space in the usual coordinates, together with the (local) 
identification p"(x) = jx", in this framework gives a connection with torsion. De- 
veloping a closed curve in spacetime in the internal space will give a curve that does 
not close in general, which is the effect of noncommuting translations in the internal 
space. 
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The reader may wonder how the "deformation" of the Minkowski solution de- 
scribed here is manifest in a metric. We can define a metric by the usual expression 

g^iv = (^l^tVab- (3.6.7) 

This metric would not determine the connection, but could be used to define distances 
in the spacetime in the usual way. Then, from fl3.6.2p . we get 

4 {x ■ x) 

dfiu = Vfj.f -I I x-x + ^fj.^v— X , ,r,.2 ■ (3.6.8) 
t + — {[X ■ X) + r) 

It should be stressed that the connection on spacetime is not the Levi-Civita con- 
nection of this metric, as we are working in a first order formulation where metric 
and connection are independent. There is a factor of 4 because of a term in (13.4. 12p 
which does not necessarily go to zero as p — )■ 0. With the identification f l3.6.ip . the 
soldering form always gets a contribution 

e^\x) = ie,% + 5; + 0(j) . (3.6.9) 



The limit k — )■ oo is now identified with the limit / — )■ oo, in which we recover the 
(rescaled) Minkowski metric. 

In deriving the expressions fl3.6.2p we started with Minkowski space, which clearly 
solves the field equations of the Einstein-Cartan theory for an energy-momentum 
tensor cancelling the cosmological constant term, and vanishing internal spin. In 
changing the zero section, we then performed a SO{d, 1) gauge transformation, under 
which the curvature F transformed as 

F{s{x)) = s~\x)F{x)s{x). (3.6.10) 

This is a general SO{d, 1) rotation which mixes up the so{d — 1, 1) and so{d, l)/so((i — 
1, 1) parts of the connection and the curvature. Hence, the resulting connection will 
no longer solve the original field equations, but the field equations for an energy- 
momentum tensor which has also undergone a SO{d, 1) transformation. This mixes 
up the energy-momentum and internal spin parts, combining them into an element 
of the Lie algebra so{d, 1), the interpretation of which seems obscure at least. 

A comment is in order with regard to physical units. In addition to the energy 
scale K, which is perhaps naturally identified with the Planck scale, the identification 
of lengths with momenta, necessary in the framework presented here, requires the 
choice of a unit of length I which is not necessarily connected to the scale k. It may 
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well be that it is instead the cosmological constant which sets this length scale, leading 
to an astronomical scale instead of a sub-atomic one. And indeed, some more recent 
approaches to quantum gravity {e.g. [SB]) use the product GA as a dimensionless 
parameter in a perturbative expansion. A fixed positive A also seems to be required in 
non-perturbative approaches to quantum gravity Then the cosmological constant 
may play the role of a fundamental parameter in quantum gravity. 

3.7 Summary and Discussion 

It has been argued that the algebra of DSR describes the symmetries of a semiclas- 
sical limit of (a generic theory of) quantum gravity (see e.g. [6]). If this claim is 
taken seriously, one has to give an interpretation of the noncommuting translations 
appearing in the algebra, and usually they are supposed to represent a spacetime 
with a fundamentally noncommutative structure [111] . Alternatively, one may view 
the apparent noncommutativity as an artefact of the finite resolution of lengths [TO] . 
However, there are fundamental difficulties in associating these noncommuting op- 
erators directly with coordinates on spacetime, as position is not additive in a way 
that momentum and angular momentum are Furthermore, as also pointed out 
in [27|, a proposed noncommutativity of spacetime of the form (13.2.21) . proportional 
to angular momentum or boost generators, and hence vanishing at a given "origin" , 
seems deeply at odds with any idea of (even Galilean) relativity. This would also 
be an obvious criticism of the framework presented in this chapter, when taken as a 
theory that is supposed to describe the real world. 

What we have shown here, is that using the framework of Einstein-Cartan-Stelle- 
West theory, one reaches a different conclusion from the usual one: The noncommu- 
tativity of translations on a momentum space of constant curvature is interpreted as 
torsion of a connection that solves the equations of Einstein-Cartan theory with a 
modified energy-momentum tensor that mixes with the spin tensor. If one takes this 
seriously, one is led to conclude that there is an effect of torsion induced by quantum 
gravity, whose effects would however only become measurable over distances compa- 
rable to /, a length scale presumably associated with the cosmological constant. 

No such effect appears in de Sitter space with an appropriate cosmological con- 
stant, or indeed any vacuum solution of the theory. Vacuum solutions are then just 
described by the Poincare algebra, and hence undeformed special relativity. It is curi- 
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ous that the curvature in the internal "momentum" space is reflected in torsion when 
Minkowski spacetime is considered as spacetime, which would however seem rather 
unnatural as it is not a vacuum solution any more. This result does however seem 
to be the most natural interpretation of the DSR algebra in a differential-geometric 
framework capable of describing more general spacetimes. 

Any non-zero energy-momentum tensor, however, will lead to a connection having 
torsion. In theories such as Einstein-Cartan theory, this leads to well-known problems 
when trying to couple the gravitational field to Maxwell fields, for instance, as there 
is no unambiguous procedure of minimal coupling. This is because the statement 
that the exterior derivative is independent of the choice of connection, 

(dA)^^ cc d[^A^] = V[^A^] , (3.7.1) 

is true precisely when torsion vanishes. Using an so{d — 1, 1) connection, this is 
apparent from 

d{e'Ai) = V(eMi) = AiVe' - e' A VAi = -e* A VAi + AiT (3.7.2) 

where Ve* = de^ + u^j A etc. One has two different candidates for the field strength 
F, namely e* A VAi and d{e^Ai), with possibly observable differences between these 
choices, although it could be argued that F = dA is the only meaningful choice 
because it preserves gauge invariance [21] . 

In the framework of Einstein-Cartan-Stelle-West theory, gauge fields should be 
coupled to gravity via development, i.e. replacing F = dA by F = DA. We compute 
from fl3.4.9p and (13.4. 19p that development can be expressed in terms of uq and cq by 

hip) - 1) 

D = d + uj-{e-V) = d + uJo + 2{p0A eo)K ^;y , =: d + ooes , (3.7.3) 

l{p-p} 

where ®^ is an antisymmetrised tensor product, (U ®yi V")"'' = U^'^V'^^ = ^{U^V'' — 
Ijbyay Parallel transport is effectively described by the connection Ues, whose torsion 
is in general non-zero. One can give an explicit formula for the torsion which is 
however rather complicated and does not seem to give much insight; to linear order 
in p\ one has 

r = -{R\),p' - -^e^ A (ey ) + {p'e.o A dpi - p,el A dp^) + 0{p'). (3.7.4) 

If we assume a universal relation of internal momenta and spacetime lengths of the 
formp ~ yx, the second and third terms seem to give contributions of order x/t^. The 
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first term is proportional to the local curvature of ujq, R% — du!%+(x!^jAuj^b, contracted 
with x^. Note that it is the Riemann tensor, not the Ricci tensor, that appears, so 
that propagating degrees of freedom of the gravitational field are included. This first 

term should in realistic situations, even in vacuum, give the dominant contribution. 

Assuming that minimal coupling is achieved through the development operator D, 
or equivalently by using the efTectivc connection which has torsion, one would couple 
vector or matter fields (using Dip for spinors) to torsion, breaking gauge invariance. 
Such an effect of course leads to the absence of charge conservation, and this should 
be experimentally observable in the presence of a non-trivial gravitational field, i.e. 
in regions where spacetime is not exactly de Sitter. Let us recall that in standard 
tensor calculus one uses the identity 

^u]Mxp = R^.x^'M,, + R^,/M,x - T^\V„M^p (3.7.5) 

which gives for an antisymmetric Mxp when contracted 

[V\ W]Mxp = -g'^V'T^^^aMxp, (3.7.6) 

to estabhsh that the right-hand side of Maxwell's equation V^F^p = 47r Jp satisfies a 
continuity equation in the absence of torsion. With torsion present, one has then for 
any region R 

[ d^xVhn^Jx [ d^x^ {-9''VT,%V,Fx,) . (3.7.7) 

JdR ^'^ Jr 

Effects become important when the size of the region R is comparable to the length 
scale of torsion. 

As an example consider the Schwarzschild solution, which has Kretschmann scalar 

RabcdR"'""' ~ ^, (3.7.8) 

so roughly Rabcd ~ rsr^^. Assuming that the origin of the x coordinate system 
corresponds to the centre of the Earth, we would, on the surface of the Earth, measure 

a torsion of order rsR^^, where R is the radius of the Earth. With rs ~ 10~^ m and 
R^ ~ 10^^ m^, the length scale for effects of torsion would be about 10^^ m. The 
other two contributions, given that / ~ 10^^ m, would be much smaller. Although 
this crude estimate suggests that effects will be very small, even tiny violations of 
charge conservation should have been observed experimentally. For a discussion of 
experimental tests of charge conservation and possible extensions of Maxwell theory 
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in Minkowski space, see |102] . Processes such as electron decay on a length scale of 
10^^ m, or a time scale of 10'^ s, can clearly be ruled out. 

The example presented here shows that the correct physical interpretation of 
purely algebraic relations, such as the commutators of the Snyder algebra, may not 
be the seemingly obvious one. We conclude that the physical motivation for assuming 
spacetime is "noncommutative" may not be as clear as often assumed. 

3.8 Gauge Invariance Broken? 

When discussing the issue of possible consequences of broken gauge invariance, we 
must bear several points in mind. 

The idea that an asymmetry between the proton and electron charges could have 
interesting astrophysical consequences goes back to Lyttleton and Bondi |108] . who 
argued that a charge difference, and hence a net charge of the hydrogen atom, of 
10~^^ elementary charges, might explain the observed expansion of the universe by 
electrostatic repulsion. This idea was proposed in connection with Hoyle's ideas 
of a universe in a steady state, which required continuous production of material 
via a "creation field" [S5], and a modification of Maxwell's equations was proposed 
to accommodate charge nonconservation. From Hoyle's perspective, however, the 
steady state model was incompatible with expansion of the universe by electrostatic 
repulsion, and should lead to electrostatic attraction instead [86] . 

There seems to be no need for the electron and proton charges to be of equal 
magnitude to maintain gauge invariance. However, if the universe as a whole is not 
neutral, but it is homogeneous, gauge invariance must be broken. Hence the two 
issues are closely related. Modern laboratory experiments |115] give a bound of 10^^^ 
elementary charges on the difference of electron and proton charge; astrophysical 
considerations give bounds of 10~^^ elementary charges using the isotropy of the 
cosmic microwave background [30j, or 10~^^ elementary charges by considering cosmic 
rays |124] . Recently, an interesting proposal to measure net charges of atoms and 
neutrons, sensitive to 10~^^ elementary charges, was put forward fS]. 

From a theoretical viewpoint, if gauge invariance is broken, it is natural to assume 
a nonvanishing photon mass. One then considers Einstein-Proca theory, an outline 
of which can be found in [79]. The photon may also be charged. Here, experimental 
bounds on the charge are 10~^^ elementary charges using pulsars [141] . and possibly 
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10~^^ elementary charges from CMB isotropy [30] . 

Experimental bounds on violations of gauge invariance in electrodynamics are very 
tight, and hence any theory predicting torsion which is coupled to electromagnetism 
faces severe problems when confronted by experiment. In the framework of Einstein- 
Cartan-Stelle-West theory, it is possible to maintain gauge invariance by choosing 
F = dA, but using the development operator is the most natural choice. 



3.9 Symmetric AfRne Theory 



If Einstein-Cartan theory is considered as the extension of general relativity which 
allows for torsion, there is an analogous extension which allows for a non-metric con- 
nection. This theory can be formulated in terms of a torsion-free Qi{n, M) connection 
and is known as symmetric afRne theory. It is equivalent to standard general 
relativity with a massive vector field, known as (nonlinear) Einstein-Proca theory 



One could attempt to embed this theory into a theory of a connection taking 



values in the algebra of the affine group a[n 

I 

A = 



\ 



1 i 



\ 



(3.9.1) 



0/ 



where now uj°'b is not constrained by u 



ab 



—CO 



ba 



Geometrically, this means that the 



connection does not preserve the lengths of vectors under parallel transport. 
The corresponding curvature of A {R is the curvature of the fll(n, M) part). 



F = dA + AAA 



\ 



Da 



V 







(3.9.2) 



would then be constrained by demanding that = 0. This seems rather unnatural 
from the perspective of Cartan geometry. Furthermore, the length scale / is now 
completely arbitrary as it does not appear in the Ql{n, M) part of the curvature any 
more, just as for a connection taking values in the Poincare algebra. 



MPor a comprehensive review of general theories of this type, see |78] . 



Deformed General Relativity and Torsion 



79 



One proceeds by considering Lagrangians that only depend on the Ricci tensor, 
which is a one-form obtained by contracting the components of the Riemann curva- 
ture, written in the basis of one- forms given by the vielbein e*: 

RiCa = RiCj„e', RiCi„ = Rji\, (3.9.3) 

where the curvature two-form is 

R\ = lRij\e' Ae^. (3.9.4) 

One then sphts the Ricci tensor into symmetric and antisymmetric part, symmetrising 
over a component (with respect to the given basis) index and a Ql{n, R) index. The 
antisymmetric part can be interpreted as a spacetime two-form 

ie^{R\Ae'), (3.9.5) 

where i^^ is interior multiphcation with the vector Ca, defined by being dual to the 
one- forms e**: 

e\ea) = 5\. (3.9.6) 

No such construction is possible for the symmetric part, which is normally more 
relevant in concrete constructions. The splitting itself seems depend on the choice of 
basis. 
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Chapter 4 

General Relativity From a 
Constrained Topological Theory 



4.1 Overview 

In this chapter, we shall investigate another possible formulation of general relativity 
in four dimensions, in terms of a constrained topological theory. Here the term 
"topological" means that the theory has a very large symmetry: All of its solutions 
are locally gauge equivalent, and there are no local degrees of freedom. Globally, 
not all solutions are necessarily gauge equivalent if there are topological obstructions. 
Starting from such a theory, we will use constraints that restrict the possible gauge 
transformations and hence allow for a less trivial theory, namely one with local degrees 
of freedom that are those of general relativity. 

The topological theory we consider is usually known as BF theory, deriving its 
name from the action 



where F is the curvature of a G-connection A and 5 is a g-valued two-form (in our 
case G will be a rotation group). The equations of motion are 



"What, then, is time? If no one ask me, I know; if I wish to 
explain to him who asks, I know not. " flT^ 



St. Augustine 




(4.1.1) 



F' 



tab 







VB 



ab 



0, 
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where V is a covariant exterior derivative involving the connection A. All flat con- 
nections are locally gauge equivalent, and the (local) equivalence of all solutions to 
the second equation can be seen [LSI by noting that the transformation 

^ab ^ ^ab ^^^ab ^ (4.1.3) 

where rj"''^ is a g- valued one- form, is a symmetry of the action fl4.1.ip . This transfor- 
mation is generated by the equation F"^ = 0, in a canonical sense, and should be 
viewed as a generalised gauge transformation. It is a symmetry on the phase space 
of the theory. Locally, any solution to VB""^ = is of the form Vr/'*'', so any solution 
is gauge equivalent to B"-^ = in this generalised sense. 

The motivation for considering BF theory is that general relativity in three di- 
mensions is just of this form: The first order formulation can be written as 




(4.1.4) 



and hence general relativity in three dimensions is topological. Since the quantisation 
of this system is relatively well understood (see [168] for Witten's formulation as 
a Chern-Simons theory, and the book [31] for an overview over different routes to 
quantisation), and seems to have many conceptual aspects such as diffeomorphism and 
Lorentz invariance in common with the higher- dimensional case, one might hope to 
gain understanding of higher- dimensional general relativity from studying BF theory. 
On the other hand, there is of course a fundamental difference between a system with 
an infinite number of (local) degrees of freedom and one with only (usually a finite 
number of) global degrees of freedom, which makes it rather unclear whether this 
approach will be fruitful. 

It has been shown [56] that in any number of dimensions d> A general relativity 
can be formulated as a constrained BF theory, where one adds to the BF action 
constraints quadratic in the field B"-^, restricting it to be a wedge product of one- 
forms that can be interpreted as a frame field. In four dimensions, where one has 
to enforce 

= e^'^de^ A e'^ (4.1.5) 

to reproduce the first order GR action (13.1.11) . this formulation (in slightly different 
complex form) was first studied by Plebanski |138j . This is also the situation most 
studied in the spin foam approach to quantum gravity, where one tries to quantise 
general relativity by quantising a BF-type theory and then implementing constraints. 
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In this chapter, we shall investigate a formulation which is in the spirit of the 
Plebanski formulation, but involving only linear constraints, of the type used re- 
cently as a new idea in the spin foam approach to quantum gravity. We identify 
both the continuum version of the linear simplicity constraints used in the quantum 
discrete context and a linear version of the quadratic volume constraints that are nec- 
essary to complete the reduction from the topological theory to gravity. We illustrate 
and discuss also the discrete counterpart of the same continuum linear constraints. 
Moreover, we show under which additional conditions the discrete volume constraints 
follow from the simplicity constraints, thus playing the role of secondary constraints. 

In the general context of the thesis, we see how a very large symmetry present 
in the original action f l4.1.ip can be constrained appropriately so as to give a theory 
with rather different physics - one with local degrees of freedom. This symmetry acts 
neither on spacetime nor on fibre bundles, but on a symplectic manifold, namely the 
phase space of a dynamical system. 

4.2 Actions for Gravity 

It should not be surprising that the equations of general relativity can be derived 
from several different action principles, leading to equivalent classical theories (in 
the case of pure gravity, at least). The statement that given equations of motion 
arise from different actions is the functional equivalent of saying that there are many 
functions with the same stationary points. Apart from constructing different actions 
as functionals of the same variables, one can also express the same physical content 
in terms of different variables (by performing a Legendre transformation, or other- 
wise), leading to even more possibilities for putting the same physics into a different 
mathematical form. 

This introduces several possible sources of ambiguity into quantisation; Feynman's 
path integral approach to quantum mechanics is based on the fact that off-shell con- 
figurations, i.e. those that do not solve the equations of motion and are not stationary 
points of the action, contribute to transition amplitudes as well. Hence different func- 
tionals of the same variables, while giving the same equations of motion, may give 
different quantum theories. For formulations in terms of different variables it is even 
less clear whether the resulting quantum theories are related or not. One may see this 
as a problem or as a virtue; when a classical theory such as general relativity seems 
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to resist conventional quantisation techniques, one may hope that one of the possi- 
ble classical reformulations offers more promise for quantisation. The most obvious 
example of this is Ashtekar new variables (TU] as a possible new route to quantis- 
ing general relativity in four dimensions. In three dimensions, a number of consistent 
quantisation schemes for general relativity are known, leading to (at least apparently) 
inequivalent theories [31] . 

But already classically, in the case of general relativity equivalent formulations 
for pure gravity can give non-equivalent theories when coupled to matter. We en- 
countered an instance of this in chapter [3] when discussing Einstein-Cartan theory; 
in contrast to conventional general relativity, where torsion is set to zero by fiat, in 
Einstein-Cartan theory matter can act as a source of torsion. This is a non-equivalence 
on the level of equations of motion which can therefore be tested more easily by ex- 
periment. In this chapter we shall consider a reformulation of pure general relativity 
only. 
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Figure 4.1: Lagrangians and Hamiltonians for general relativity in four dimensions. (Es- 
sentially taken from |131| .) 



In figure HT], essentially taken from the review [131] . we show a few possible action 
pr,„c,pi; fo7ge,>e.a. .e.afvity and the. i„ter.e.a"„ J The most commonly used 



'Figure 14.11 is by no means meant to be a complete display of all known actions for general 



General Relativity From a Constrained Topological Theory 



84 



ones are the Einstein-Hilbert action [82], the Palatini first order formulation, and its 
modification proposed by Hoist [83]. This last one is of special interest because it is 
the classical, covariant starting point for the canonical quantisation leading to loop 
quantum gravity ^145j . Even if not the only possible useful one |15U] . a particularly 
popular action in covariant approaches to quantising gravity |122] . like the spin foam 
|133] and group field theory approach [121] . is the formulation as a constrained BF 
(or Plebanski |138[ llUlj ) theory. Here one starts from topological BF theory [HI] in 
four spacetime dimensions, and adds suitable constraints on the two-form B variables 
of the theory such that, on solutions of these constraints, the action reduces to the 
Palatini or Hoist action for general relativity. We will summarise the idea behind this 
formulation in the following. In the original Plebanski formulation the constraints 
on the B variables are quadratic, and so are the discrete constraints that are then 
implemented in the spin foam models based on a simplicial discretisation. On the 
other hand, the most recent developments in the spin foam and group field theory 
approach to quantum gravity are based on a linear set of discrete constraints, which 
can be shown to be slightly stronger, in the restrictions they impose on the original 
BF configurations, than some of the original discrete quadratic constraints. Once 
more, we will detail this construction in the following. 

Here we investigate whether a formulation in terms of linear constraints is also 
possible in the classical continuum theory, and what it implies. We will see that the 
replacement of the so-called diagonal and cross-simplicity constraints with linear con- 
straints at the continuum level is relatively straightforward, after one has introduced 
new variables forming a basis of three-forms at each point. One then needs addi- 
tional constraints corresponding to the volume constraints. We will see that one can 
linearise these constraints too. We then give a discrete version of these linear volume 
constraints (thinking of spacetime which is discretised in a triangulation), which bears 
a striking resemblance to the so-called "edge simplicity" constraints of [47j. We note 
that only certain linear combinations of the volume constraints one would naively 
write down are necessary to constrain the bivectors T,^^{A) sufficiently. Similarly to 
the quadratic case, we will also see that when linear diagonal and off-diagonal con- 
straints hold everywhere in a 4-simplex, and one also imposes "closure" constraints 
on both bivectors E"*(A) (associated to triangles) and normals na{A) (associated to 

relativity; for instance, an interesting variant of the (complex) Plebanski formulation has recently 
been explored by Eyo Eyo Ita III (Ph.D. thesis, University of Cambridge (2010)). 
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tetrahedra), the sufficient set of linear combinations of the hnear volume constraints 
follows. This additional four- dimensional closure constraint on the normal vectors 
has, to the best of our knowledge, not been considered or implemented as an ad- 
ditional condition in the spin foam literature yet, although it does appear in some 
ffist order formulation of Regge calculus |35j, and it plays also a role in the discrete 
analysis of |^|D]. 



4.3 Constraining BF Theory 

Let us briefly review what is known at the classical continuum and discrete level, 
concerning the Plebanski formulation of classical gravity. We limit our considerations 
to the covariant, Lagrangian context, and to a very small subset of the available 
results, those which have been already of direct relevance for quantum gravity model 
building, especially in the spin foam context. For recent results in the canonical 
Hamiltonian setting, both continuum and discrete, see |28 | W7 \ 0.60] . 

In this chapter we consider the Einstein- Hilbert- Palatini- Hoist [821 [83] Lagrangian 
(without cosmological constant), a modification of the action (13. Lip used earlier, 

^EHPH = ^ / (I eabcd e^Ae'A R'^'lu] + 1 A e" A Rab[co]] , (4.3.1) 

where spacetime is assumed to be of the form 7^ x M so that a (3+1) splitting can 
be performed, cu"^ is a G-connection one- form (the gauge group G is 5*0(3, 1) or 
5*0(4), or an appropriate cover), R"''' its curvature, and is an R'^- valued one- form 
representing an orthonormal frame. The term involving 7, known as the Hoist term, 
is not relevant classically; it is, up to a total derivativqll, proportional to T° A T^, 
and hence does not modify the classical equation of motion = 0. When (14.3. ip is 
coupled to matter, the Hoist term leads to a rescaling of the couphng of the matter 
spin density to torsion. 

Though not relevant classically, the Hoist term is of fundamental importance in 
loop quantum gravity (LQG), where 7 is known as the Barbero-Immirzi parame- 
ter, and more generally for any canonical formulation of gravity, as it modifies the 
symplectic structure of the theory. 

If one introduces a g-valued two-form, 

= ^ (l^^'cd Ae^^ ie'^ A e^) , (4.3.2) 



^For a discussion of the total derivative term see 1117 . 
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then the action fl4.3.1|l becomeJi] 

S = Jb'^'a R,,[u] + A°C,[5] , (4.3.3) 

i. e. it takes the form of a topological BF theory with additional constraints which 
enforce that B""^ is indeed of the form f l4.3.2p . and that are enforced by means of 
Lagrange multipliers Xa- 

As said, BF theory without constraints is topological. Its equations of motion 
imply that Lo^^ is flat and the covariant exterior derivative of vanishes. Having no 
local degrees of freedom, the quantisation of such a theory is therefore rather simple 
and quite well understood. Inspired by this classical formulation, the main issue when 
trying to construct a quantum theory related to quantum gravity, in four dimensions, 
is then the correct implementation of appropriate constraints that lead to f l4.3.2p for 
some set of one- forms e", either at the level of quantum states or in a path integral 
formulation. Indeed, the bulk of the work in the spin foam approach [l9l [571 1105[ 1133] 
(as well as in the group field theory formalism [161 11211 1123] ). in recent years, has 
been devoted to this task. These constraints are also the subject of this chapter. 

To simplify the following calculations, we introduce another two-form field T,"'^, 

where s is the spacetime signature, s = — 1 for G = S0{3, 1) and s = +1 for 
G = SO (A) (and we assume 7^ 7^ sj^. This is a linear redefinition which simplifies 
the constraint f l4.3.2|) . 

E^'' = e" A e\ (4.3.5) 
87r7G 

but leads to more terms in the action. The translation of all calculations from one 
set of variables to the other is usually straightforward. 

The traditional way to enforce the restriction fl4.3.5p . the one matching the origi- 
nal classical Plebanski formulation of gravity, was to add quadratic simplicity con- 
straints^ to the action [iSlll43j . 

CafecdS^^S^^ = Ve^„pa , (4.3.6) 



■'■We use indices from the beginning of the Greek alphabet such as a to denote abstract indices 
not associated -with certain transformation groups. 

§For uniformity of the discussion, we shall in the foUo'wing talk about "time" and use the label 
even ■when the gauge group is SO (A) and the signature Riemannian. 

^ "Simplicity" because a t^wo-form that can be ■written as a ■wedge product of one-forms is called 

simple. 
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where V can be expressed in terms of S"^ by contracting (14.3.61) with e^^f"' ^ to give: 
V = ^e'^'^P" eabcd^l^l^pt ■ This is itself a reformulation of the original Plebanski con- 
straint, which would read: 

e'^'^'^Ef^Ef^ = Ve"^"'^ , (4.3.7) 

and is equivalent to the first under assumption that V ^ Q everywhere. The version 
(14.3.61) has the advantage of permitting a much simpler discretisation and thus a 
more straightforward implementation within the spin foam formalism. Under the 
same assumption V 7^ 0, there are the following four classes of solutions to (14.3.61) : 

either S'^'' = ±e" A or T."^ = ±]^t''\dE'' ^ E'^ (4.3.8) 

for some set of one-forms E°- or (in the following, we reserve for one-forms 
satisfying the first relation, viewed as encoding the metric; since we consider pure 
gravity, the factor 8'7r7G' can obviously be introduced by rescaling). One would like 
to select only the first class of solutions S*^^ = +6°" A e**, which, when substituted in 
the BF action, gives the Hoist action (14.3.11) . Classically, this is not a severe problem. 
As shown in |143j . non-degenerate initial data of a solution of the form S"'' = +e" Ae^ 
generically remain within the same branch of solutions. The situation in the quantum 
theory, where one necessarily has contributions from all branches, is less clear. 

More troublesome, if = 0, the field S"'' does not permit a straightforward 
geometric interpretation at all. Since in the region of the phase space where = 0, 
the theory is less constrained, and hence has more degrees of freedom, these non- 
geometric configurations should be expected to be dominating in a path integral 
|143] . unless measure factors are such that this is avoided. 

Spin foam models are usually defined in a piecewise fiat context, and spin foam 
amplitudes are defined for given simplicial complexes [133 ] . Therefore one is interested 
in identifying a discrete version of the above constraints that could be imposed at 
the level of each complex. The version (I4.3.7P of the simplicity constraints admits 
only a rather involved discrete counterpart |15] and, upon quantisation, leads to the 
Reisenberger model [HI 1142] , which has so far received only limited attention. 

The discrete analogue of the constraints (14.3.61) led instead [HI 1143] to the con- 
struction of the Barrett-Crane model [17], in the case in which the Barbero-Immirzi 
parameter is excluded from the original action (7 — )■ 00). The construction is ini- 
tially limited to a single 4-simplex, the convex hull of five points in (M^''^, in the 
Lorentzian case) with the topology of a 4-ball, whose boundary is triangulated by the 



General Relativity From a Constrained Topological Theory 



88 



five tetrahedra identified by the five independent subsets of four sucli points, wliile 
subsets of tfiree points identify the four triangles belonging to each of these five tetra- 
hedra, each of the triangles being shared by a pair of tetrahedra (see figure . One 

P5W / \ ^Ps 



Pi P2 

Figure 4.2: A 4-simplex is generated by five points. 

then associates a Lie algebra element (bivector) E^f G so (4) ~ A^M^ (similarly in 
the Lorentzian case) to each triangle A in a given triangulation by integrating the 
two-form S"^ over A. The task is then to constrain appropriately these Lie algebra 
variables (or their quantum counterpart) following the continuum treatment. 

It is useful to split the set of continuum equations 04.3.61) into two sets. Out of 
the 21 equations fl4.3.6p . one first identifies and imposes those 18 which have zero on 
the right-hand side (the "diagonal" and "off-diagonal" simplicity constraints), 

eabcd^f.^'^i = ea6cdS;:tS^p = V/i, z/, p (no sum over /i, u) . (4.3.9) 

This corresponds to the case if one or two of the indices of the two fields E coincide. 
At the discrete level, this translates into two triangles on which the same fields are 
discretised which either coincide or at least share a single edge, and thus belong to 
the same tetrahedron. Thus all bivectors S^^ are required to satisfy 

eabcd'^A^A = (diagonal simplicity constraint) 

and 

^abcd^A^A' = for all A, A' sharing an edge (cross-simplicity constraint). 

These two sets of equations can be imposed at the level of each tetrahedron in the 
4-simplex. 

The remaining three equations (the "volume" constraints) are equivalent to the 
requirement that: 

^abcd^0l^23 ~ ^abcd'^02^13 ~ ^abcd'^03^12 ^ ^(^) ' (4.3.10) 
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and can be imposed at the discrete level as the requirement that, for each 4-simplex: 
^abcd^A^A' — ^ for all A, A' not sharing an edge (volume constraints) 



where V is defined by the above equation, and is interpreted, on the solutions of the 
constraints, as the volume of the 4-simplex. 

An additional condition on the bivectors is usually considered, namely the "clo- 
sure" constraint, which states that the sum of four bivectors corresponding to the 
faces of one tetrahedron is zero: 



This constraint can be understood in two ways. One can either view it as the condition 
that the triangles described by the variables S^f close to form a tetrahedron [H], or 
as a consequence of the equations of motion. In a topologically trivial region such 
as the interior of a tetrahedron, a flat connection can be set to zero by a gauge 
transformation. Then using Stokes' theorem, the integral over the equation rfE"* = 
can be written as /^S"^* = 0, which is the closure constraint. The canonical 
counterpart of this condition is the so-called Gauss constraint, which generates local 
gauge (rotation) transformations and is to be imposed on the quantum states of the 
theory. 

The same picture appears in three spacetime dimensions, where there are no sim- 
plicity constraints and one directly deals with a 5u(2) connection one-form and an 
5u(2)- valued (using 5u(2) ^ M^) one- form e". Here the equation de"" = is integrated 
over a (spacetime) triangle to give a closure constraint. The vectors associated to 
the edges of the triangle add up to zero, and thus have a consistent geometric in- 
terpretation as edge vectors in M^. In this sense, an n-form with vanishing exterior 
derivative and appropriate internal indices can be given a geometric interpretation 
as describing n-simplices closing up to form an (n + l)-simplex. We shall encounter 
another instance of this statement later on. 

The closure constraint, being linear in the S's and local in each tetrahedron, is 
obviously easier to impose at the discrete level, and in the quantum theory, than the 
volume constraints. Thus it is a useful fact that it can indeed be imposed instead 
of them. More precisely, it can be shown |1U5] that the volume constraints in each 
4-simplex are implied if one has enforced the diagonal and cross-diagonal simplicity 
constraints, plus the closure conditions everywhere, i.e. in all the tetrahedra of the 



(4.3.11) 




(4.3.12) 



ACA 
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4-simplex (in general, i.e. for non- degenerate 4-simplices, involving tetrahedra be- 
longing to different "time slices"). From a canonical perspective, this observation is 
usually phrased as an interpretation of the volume constraints as "secondary con- 
straints" required to guarantee conservation of the other constraints (including the 
Gauss (closure) constraint) under time evolution. 

After a period of investigations, several potentially worrying issues have been put 
forward regarding the Barrett-Crane model PQ [12] (for a more recent analysis of 
the geometry of the Barrett-Crane model, see [IS|), and have given impetus to the 
development of alternative spin foam models |l9l|57]. These models are known to have 
nice semiclassical properties [18], and, importantly, generalise the spin foam setting 
to include the Barbero-Immirzi parameter at the quantum level (for an early attempt, 
see |107] ). and thanks to this allow for a more direct contact with the canonical loop 
quantum gravity. Their study is still somewhat preliminary, but the above properties 
make them promising candidates for a quantum theory related to gravity. One of 
the central features of the new models is the replacement of the quadratic simplicity 
constraints f l4.3.9p by linear constraints of the form 

na(A)S"''(A) = VA C A, (4.3.13) 

where Ua is the normal associated to the tetrahedron A and A is any of the faces of 
A. 

It can be shown that these are lightly stronger than the discrete diagonal and off- 
diagonal quadratic simplicity constraints, and remove some of the discrete ambiguity 
in the solution for S"*: out of the classes of solutions (I4.3.8p . one can restrict to (the 
discrete version of) S'^'' = ±e" Ae'' only. For a geometric analysis of these conditions in 
the discrete setting, see |10l HH |57] , and for a proof that the same discrete conditions 
can also lead to the Barrett-Crane model, see |16j . 

4.4 Linear Constraints for BF-Plebanski Theory 

The purpose of this chapter is to investigate whether a formulation in terms of linear 
constraints is also possible in the classical continuum theory, and what it implies. 

Let us work backwards, at first. Assume that the two-form field S"* is of the form 
j^ab _ j^a ^ j^h^ iYisX the "frame field" (not necessarily associated with a metric) 
E°- is non-degenerate, i.e. that the matrix (-E^) is invertible. It follows that 

EP,^f, = 5^^El~5tEl. (4.4.1) 
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In order to make a connection to the discrete setting it is more convenient to work 
with exterior powers of the cotangent bundle only (n-forms can be integrated over n- 
dimensional sub manifolds) . Hence we multiply f l4.4.ip by ep^rv and insert the relation 
jE^^ = (detEj^) eadefE^E!^E[, which is true for invertible matrices, obtaining 



eadefEiElEl^^ = (det E^) {e,^,,El - e^^^.E^) . (4.4.2) 

One can define the three- form Uaarv = '"■a[o-TD] = ^adefE^E!^E[, so that fl4.4.2p take 
the form 

Ua^rv^f, = (det E^) {e^^rvEt - e.^rvEl) . (4.4.3) 

HacTTv can be interpreted as a 3D volume form for the submanifold parametrised by 
(x*^, x'^, x^) embedded in 4D spacetime, whose internal index gives the normal to this 
submanifold. If E'^ are a basis of one- forms at each spacetime point, then Tifi are a 
basis of three- forms at each spacetime point, and so one can choose to work either 
with one or the other set of variables. Clearly E"- can be reconstructed from na. 



leP---na.rv = s(det E^)Ef = s^det Qe— n^^.^^ E^ . (AAA) 

This means that the set of variables na{x) define a co-tetrad frame at any point of 
the spacetime manifold (for the discrete analogue of the above, see [40'j). 

4.4.1 Linearised Diagonal and OfF-Diagonal Constraints 

So far we have just rewritten the equation we want to obtain for S'*^. Let us now 
consider the implications of imposing f l4.4.3p as constraints, where we restrict to those 
with zero right-hand side, i.e. those for which {/i, u} C {a, r, v}. These are half of the 
equations fl4.4.3p . This will identify the continuum analogue of the linear simplicity 
constraints. 

Claim 4.4.1 For a basis Ua of three-forms, the general solution to 

na.rv^f, = y{fi,u}c{a,T,v} (4.4.5) 

is 

K = G,uE^;e'J , (4.4.6) 
where E'^ is defined in terms ofriaarv (is in lj^4.4-4\ l> ^^'^ so in particular non-degenerate, 



and Gpy = G^^, and = 0. (Obviously, as the variables E"^ and E"- , the "coeffi- 
cients" Gfj_i, are spacetime dependent.) 
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Proof. First note that we can rewrite fl4.4.5|l as 

ea,,fEiElElT.f, = (4.4.7) 

with defined by fl4.4.4p . Then E'^ by assumption defines a basis in the cotangent 
space, so that 

K = GliE^El (4.4.8) 
for some coefficients Gf^ with Gil = G^^^y Substituting this into fliXTj) . we get 

= eadefE^X^^ElG^iE'^ = e^.™ det(E;)4G;^« , (4.4.9) 

and since det{E'^) ^ and E^ form a basis of (the internal) M'^, this imphes that 

^x<rrvGli = y{fi,iy}c{a,r,v}. (4.4.10) 

It follows that G^l = unless {x,^} = {l^, ^} aiid so G^l = sl^S^Gny. 

□ 

By a linear redefinition = A^i?^ one might try to set some of the G^,^ to a given 
value (usually ±1, but one might prefer ±g^^), but it is clear that one needs two 
conditions on the G^u for this to be possible. 

In the discrete context, one sets na{^) = (1, 0, 0, 0) for each A by a gauge trans- 
formationjii One could use some of the gauge freedom here to restrict the form 
of Ha'- This amounts to finding a convenient parametrisation for the coset space 
GL(4)/S'0(3, 1). Let us make the (usual) assumption that the normal to hypersur- 
faces {t = constant} is indeed timelike. Then one can use the boost part of S0{3, 1) 
to set nai23 = (C, 0,0,0). The remaining S0{3) subgroup can then be used to make 
the (3 X 3) matrix njoo-r, where i G {1, 2, 3}, upper diagonal, so that one has the form 



(4.4.11) 



Clearly, when this form of riaarv is assumed, integrating the three-form Ua over a 
region where t is constant gives a vector in that only has a time component. Its 
magnitude specifies the three-dimensional volume of such a region. 
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"This seems to involve an implicit assumption, namely that there is a non-degenerate normal to 
each tetrahedron, as well. 
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4.4.2 Linearised Volume Constraints 

As in the quadratic case, further constraints, in addition to the hnear simphcity 
constraints f l4.4.5p . are needed to complete the identification S"* = ±e" A e^. 
First of all, one can show the following. 

Claim 4.4.2 Under the assumption that all G^y are non-zero, the necessary and 
sufficient conditions for the existence of a linear redefinition = )^i_iE'^, such that 
either S"^ = ce° A or T,"-'' = — ce" A e^, where c is a given positive number, are 

GuGo3 = G01G23 = GisGoal^ 0) . (4.4.12) 

Proof. Set = 1] the extension to arbitrary c amounts to a further rescaling by 
y/c. Then the required redefinition is possible if and only if there exist Aq, . . . , A3, 
such that either G^^, = X^Xy for all /i 7^ z/ or G^y = —X^Xy for all \x ^ v. Clearly 
(14.4. 12p are necessary. They are also sufficient: Take 



Ai 



G 



23 



A2 = sgn ( — — , A3 = sgn — — , (4.4.13) 



which solves the equations for 6*12,^13 and (^23 with sgn ^ '^g^" j specifying the 
overall sign. The remaining three equations for Goi)G^02 and Cos are then solved by 
the two relations (14.4. 12p and 

^ ^^"^ V G23 ) ^ ■ (4.4.14) 

□ 

The assumption G^y 7^ is necessary: One solution to (14.4. 12p is G12 = G23 = 
Gi3 = with the other G^y non-zero, which cannot be expressed as G^y = iA^Aj^. 

Further constraints, in addition to the linear simplicity constraints (I4.4.5p . are 
needed to complete the identification S"'' = ±e" A e^. One possibility is to use the 
quadratic volume constraints (14.4.120 . Take the three volume constraints (14.3. lOp . 

ca6cd^01^23 — ^abcd^Q2^Vi — ^abcd^Q'i^l2 i [^.^.10) 

and substitute the solution = G^yE^^Et^ of This gives precisely (14.4. 12p . 

The non-degeneracy assumption needed for (14.4. 12p is then the usual one, namely 
^ in dlSSD. 
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This shows that imposing the hnear version of the diagonal and off-diagonal sim- 
plicity constraints f l4.4.5p together with the quadratic volume constraints (14.3. lOp and 
a non- degeneracy assumption on S"^ implies that 



ice'' A (4.4.16) 



for some set of one-forms e", where c > can be chosen at will. Thus, linearising the 
diagonal and off-diagonal simplicity constraints means that two of the four types of 
solutions for S'^* are removed, but on the other hand one needs to introduce a basis 
of three-forms at each spacetime point, which is put in as an additional variable. 
One also still has to assume V ^ 0. 

There is also generically no evolution of initial data with V ^ into a degenerate 
E"* with V = and a non-geometric interpretation (this is part of the discussion 
of |143j ). The geometry of the spacetime manifold is specified by e"' and not by E"- 
which is only used to determine normals in the constraints. 

Alternatively, one might prefer to use a linear version of the volume constraints 
as well. Consider the original equation f l4.4.3p 



n. 



aarv 



E J = (det {e,^rvEt - e^^^^El) , (4.4.17) 



which was equivalent to E"'' = E"" /\ for a basis of one-forms E"". So far we only 
considered one half of these equations, namely those with {/x, z/} C {u, r, t;}. The 
other half have the form 



n. 



E J = (det El)t^,,^El , no sum over v , (4.4.18) 



avTV ^ 



with e^yrv 7^ 0. One way to read these equations is as the requirement on the left-hand 
side to be totally antisymmetric in {fj,,T,v): 



UauTV^lt = Uauvf^^rl = Uauf^r^yl , nO SUm OVCr U . (4.4.19) 

We could again try to turn the argument around and impose (14.4.191) as constraints 
on a 0- valued two- form E'*'' together with the linear simplicity constraints (14.4.51) . 
Substituting the solution E^^ = Gf^iyEjj'Eu of the linear simplicity constraints into 
(14.4. 19p gives (after diving by a non-zero factor |det(£'^)) 

(^^uTvG i^ipE^^ tfi^^^Gfi/E^ t^uufGyi^E^ . (4.4.20) 
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For e^urv 7^ this would imply G^u = Gru = Gy^- By Claim |4A2| imposing (14.4. 19p 
for one fixed z/, say z/ = 0, is generically not sufficient: If we know that Gqi = G02 = 
Go3 7^ 0, we still have the condition 

G'i2 = Gi3 = G23, (4.4.21) 

so that one needs more conditions of the form (14.4.191) . These will then imply that 
all G^u are equal, G^^ = c' for some c' that could be positive, negative, or zero. One 
can absorb |c'| by an overall redefinition, so that one has 

Ea6 = ±ce''Ae^ (4.4.22) 

for any chosen c, as before. Note that here it is possible, if c' = at a point, that 
all are zero this point and so S'*^ = as well. While this is a very degenerate 
geometry, it is still a geometry. 

While the conditions (14.4.191) . imposed for all values of z/, are therefore sufficient 
to complete the identification of the two-form field S"'' as ±ce" Ae^, note that (14.4. 19p 
is a massively redundant set of constraints: In order to obtain at most five relations 
on the coefiicents G^j, (two relations (14.4.121) if all G^i, are nonzero), we are imposing 
eight vector equations! We have not exploited the fact that (I4.4.20p is a multiple of 
one of the vectors -E^, which are by assumption linearly independent. We could add 
several of the equations (14.4. 19p for different u, instead of considering all equations 
for different u separately. Let us try to impose 

J2 Yl riaurv^f, = 0, rG {0,1, 2, 3} fixed. (4.4.23) 

Again substituting the solution SJ^^ = G^yE^^E^ of the linear simplicity constraints 
into (14.4.231) . we obtain 

^det(E^)J] ^^^.^' = 0, r fixed, (4.4.24) 

which implies, by linear independence of the E^, that indeed Gfj_„ = G^u for all 
r ^ {/i, z/, t;}. It is then sufficient to impose the constraint (14.4.230 for three different 
choices of r, say r = 0,1,2, so that we only need three vector equations instead of 
eight. 

By absorbing the constant G^y = d (all Gp.^ are equal) we rescale all E°- by the 
same factor to obtain the variables e" that will have the physical intepretation of frame 
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fields encoding the metric geometry of spacetime. Wliile in tfie case of quadratic vol- 
ume constraints the one-forms E"", or alternatively the three- forms n°', only specified 
the normals to sub manifolds {x^ = constant}, for linear volume constraints they can 
be directly interpreted, up to a position-dependent normalisation, as specifying an 
orthonormal basis in the cotangent space. 

Note that this implies that one can assume a convenient normalisation for the 
one-forms E"". Instead of just assuming non- degeneracy det{E'^) ^ 0, one could fix 
det(£'^) = 1. This is no restriction of the physical content of the theory as the E"", for 
both linear and quadratic volume constraints, only have a geometric interpretation 
after rescaling. One could then interpret E'^ as a map into SL{4, M). For linear volume 
constraints, the relation between the normalised one-forms E"" and the variables 
that are interpreted as frame fields is a single function on spacetime which may be 
viewed as a "gauge" in the sense of Weyl [163] . 

In contrast to the case of the quadratic volume constraint, no non-degeneracy 
assumption on the two-form S"'' is needed to enforce simplicity. One might get 
H"-^ = in some region as a solution to the constraints, in which case the action for 
this region will be zero. This is analogous to a metric with vanishing determinant 
in general relativity and, in contrast to the requirement V ^ outlined above, not 
an additional issue. Notice, however, that one still has to assume that the tetrad 
field E°- and, equivalently, the co-tetrad field Ua are non-degenerate, in order for the 
simplicity and volume constraints to imply (14.4.161) . Failing this, one gets solutions 
of the constraints that admit no proper geometric interpretation. 

In the end, writing the action for BF theory in terms of S"'', 

S = j B^''AR,, = J S"^ A Rab + ^e'^'cd^"' A Rab , (4.4.25) 
we substitute (14.4. 16p into this action, which gives (setting c = g;;^) 

S=^ [ crix) d-eabcde" A e" A R^'' + -e" A A R^!] . (4.4.26) 

One is left with a field a{x) that can take the values ±1, but in the classical theory 
one may again argue that if a = 1 everywhere on an initial hypersurface, there will 
be no evolution into cr = — 1. What we obtain is first order general relativity where 
one uses (det e) = ±| det e| instead of | det e| as a volume element in the action. If a 
is continuous as classical fields usually are assumed to be, this differs from the action 
with I dete| by an overall sign at most. 
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To summarise, we have identified both a hnear version of the quadratic simplicity 
constraints and a hnear version of the (quadratic) volume constraints in the con- 
tinuum, which can be used to reduce topological BF theory to 4D gravity in the 
continuum. We have found also that both linear versions are slightly stronger [i.e. 
more restrictive) than the corresponding quadratic constraints, so that the resulting 
constrained theory is likely to be closer to gravity at the quantum level than the 
one in which quadratic constraints are implemented. We now discuss the discrete 
counterpart of the constraints found above. 

4.4.3 Discrete Linear Constraints and their Relations 

The discrete analogue of fl4.4.5p is just the linear constraint used in llSlETj, as desired: 

na(A)S''^(A) = VA c A. (4.4.27) 

One could write down also a discrete version of fl4.4.19p . obtained in the natural 
way, demanding that within the same 4-simplex 

n„(A)S''^(A') = na{A')j:^\A") (4.4.28) 

whenever A' ^ A and A" ^ A' and the edge shared by A' and A is the same as that 
shared by A' and A". 

In the following we adopt the notation of |105] . where the tetrahedra in a given 
4-simplex are labelled by A, B, C, D, E, so that triangles are represented by AB, AC, 
etc., and edges by combinations ABC, ABD, etc. The orientation of the triangles 
and tetrahedra in (I4.4.28P is then fixed by the signs of the permutations of the letters, 

n,(AA)S'^'(ABc) = -^a(AB)S'^'(AAc) = ^a(Ac)S»'(AAB), etc. (4.4.29) 

In analogy to the continuum case, it will be sufficient to impose, instead of the 
full set of conditions f l4.4.29p . certain linear combinations of f l4.4.29p to complete the 
geometric interpretation of the bivectors S'^''(A). The discrete analogue of the three 
continuum equations f l4.4.23p . where the index e was kept fixed, is to pick one of 
the tetrahedra and add those six equations out of (14.4.290 which involve triangles 
belonging to this tetrahedron. Starting with A, we impose the constraint 



n,(Ai)S"'(AAj) = 0, 



(4.4.30) 
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and the equivalent conditions for the tetrahedra B to E, thereby needing to satisfy 
only five instead of 20 volume constraints. 

The above discrete formulation of the linearised volume constraints resembles 
strongly the edge simplicity constraints studied, in a canonical setting, in |17] , and it 
imposes indeed the same restriction on the discrete data. However, it does not match 
exactly any of the various expressions given for these edge simplicity constraints in 
|47j . The correspondence between the two, therefore, deserves to be studied in more 
detail, given also that edge simplicity constraints have been shown to be crucial for 
the kinematical phase space of BF theory (and of loop gravity) to reduce to that of 
discrete gravity, in accordance with what we find here in a covariant setting. 

In spin foam models such as 021 EZ], as we mentioned earlier, only the diagonal 
and off-diagonal simplicity constraints, but no quadratic volume constraints ( 14.3. lOp 
are imposed. This is because in the discrete setting, one can use the closure con- 
straint f l4.3.12p . imposed in all the tetrahedra in a 4-simplex, to relate the (quadratic) 
simplicity constraints to the volume constraints, so that if the former are imposed 
everywhere the latter follows. Since the quadratic simplicity constraints follow from 
the linear ones, as can be easily checked, this argument is still valid if one uses linear 
simplicity constraints. 

One might hope that the sufficient set of linear volume constraints (14.4.300 would 
also follow from the linear simplicity constraints and the closure constraints. This 
is almost the case, but not quite. In fact, one more constraint should be added to 
simplicity and closure imposed in the five tetrahedra in the 4-simplex. This is a "4D 
closure" constraint of the form 

HaiAj,) + UaiAs) + UaiAc) + ^^(Ad) + ^^(Ae) = , (4.4.31) 

where Aj are the (appropriately oriented) tetrahedra of a given 4-simplex. 

Just as for the usual closure constraint (14.3. 12p . there are two ways to understand 
why such a constraint must be imposed. Recall that if one demands the triangles 
described by discrete variables close to form a tetrahedron, they have to satisfy 
(14.3. 12p . Alternatively, one can start with the continuum field equation V[^''S°^j = 0, 
where V*-'^-' is the covariant derivative for the connection a;"'', set the (fiat) connection 
to zero by a gauge transformation, and integrate this over an infinitesimal 3-ball 
(whose triangulation is a tetrahedron). 

The new constraint (I4.4.3ip seems to be the analogous statement that tetrahedra 
close up to form a 4-simplex. By Hodge duality A^R^ ^ A^M^ and any internal cov- 
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ector na(A) can be mapped to a three- form; unlike for two- forms, any three-form can 
be written as A E'^ A E^ for some e°. Demanding that the tetrahedra described 
by these three-forms form a closed surface is then fl4.4.31l) . Thus the simplicial geo- 
metric reasoning goes through also for this new constraint. In terms of the equations 
of motion of the theory, on the other hand, the only argument for the need of this 
constraint is the following. If Vj|^''S"^j = and we assume that S"** = ±e" A e^, then 
it follows that Vj^^^'n^^^] = as well. Integrating this equation (with the connection 
again set to zero) over a 4-ball (which can be thought of as our 4-simplex) whose 
boundary is a 3-sphere, triangulated by tetrahedra, then leads to f l4.4.3ip . We then 
however have to assume simplicity of S"^. A more direct derivation of fl4.4.3ip from 
the equations of motion would be desirable. 



D 




A B 



Figure 4.3: Dual picture of a 4-simplex; points represent tetrahedra, lines represent trian- 
gles shared by adjacent tetrahedra. 

The role of this constraint, anyway, is the following. Consider the closure con- 
straint 

S'^''(Aab) + S"''(Aac) + S'^^(Aad) + S"''(Aae) = 0. (4.4.32) 

Contracting with ^^(Ab) gives, using the linear simplicity constraint na(AB)S"^(AAB) = 
0, 

n,(AB)S'^*(AAc) + na(AB)S°^(AAD) + riaiA^)!:-" {Aj^^) = 0. (4.4.33) 

Alternatively, one may start with the 4d closure constraint and contract with S'^-^(Aab) 
to get, again using the linear simplicity constraints, 

n,(Ac)S'^''(AAB) + ^a(AD)S'^''(AAB) + na(AE)E'^''( Aab) = 0. (4.4.34) 

In total one obtains 20 -|- 10 = 30 equations of this kind that can be used to express 
some of the combinations na(A)S"''(A) in terms of others. Substituting the result- 
ing expressions into the five discrete volume constraints (14.4. 30p one finds that the 
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equations fl4.4.30p indeed follow from the relations f l4.4.33p and fl4.4.34p . We have 
seen in the continuum that the summed constraints (14.4.230 are sufficient to identify 
T,^^ = ±e^Ae^, and hence we find that in the discrete case the situation is analogous 
to the case of quadratic constraints in that a sufficient set of volume constraints can 
be viewed as secondary. 

To see more clearly what happens in both our construction and in the case of 
quadratic constraints analyzed in [105], note that in our linear case one could use the 
3d and 4d closure constraints to express the variables ^^(Ae) and ^'^^(Aae)? ^"^(Abe), 
E"''(Ace), S"^(Ade) in terms of the others. Taking the linear simplicity constraints 
into account, one is then left with twelve independent combinations na(A)S'^*(A), 
just as in the continuum. In the continuum, we saw that one can impose the three 
additional constraints f l4.4.23p on the twelve contractions ria^^^S^^ to complete the 
identification S*^^ = ±e" A e^. In the discrete case, one has the following three addi- 
tional conditions coming from linear cross-simplicity constraints: 

= n„(AE)S'^^(AAE) 

= n,(AB)S'^^(AAc) + n„(AB)S"^(AAD) + n,(Ac)S"''(AAB) 

+n,(Ac)S"'(AAD) + ^a(AD)S"^(AAB) + ^a(AD)S'^^(AAc) (4.4.35) 

and similar ones coming from na(AE)S"^(ABE) = and ri(j(AE)S"^(AcE) = 0. These 
are precisely the analogue of the continuum constraints fl4.4.23p . 

Similarly, in the case of quadratic simplicity constraints, one can use 3d closure 
to eliminate S'^''(Aae), S'^^(Abe), S'^^(Ace), S''*(Ade)- Then one observes that ad- 
ditional quadratic cross-simplicity constraints give expressions such as 

= e,,,,S'^^(AAE)S^'^(ABE) 

= e,fe,rfS'^^(AAc)S^'^(ABD) + eafeedS'''(AAD)S^'^(ABc) (4.4.36) 

which are equivalent to the desired (two) volume constraints. 

All of this is an exercise in solving a system of linear equations for which there 
might be a more simple and elegant description, but the upshot is the following. 
The sufficient set of linear volume constraints f l4.4.23p does indeed follow from the 
linear simplicity constraints and the closure constraints, once one also imposes a 
four- dimensional closure constraint on the normals to tetrahedra that seems very 
natural in light of their geometric interpretation. Just as in the formulation in terms 
of quadratic simplicity constraints |105] . the volume constraints can be viewed as 
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secondary constraints that imply conservation of the simphcity constraints in time, 
or put differently, the volume constraints follow if the simplicity constraints hold 
everywhere. Once more this strenghtens the relationship between the discrete linear 
volume constraints we have identified and the edge simplicity constraints of |17] . 



■^ab 



4.5 Lagrangian and Hamiltonian Formulation 

Let us briefly outline the Lagrangian formulation of 4D gravity resulting from our 
linear constraints added to BF theory. One adds the linear simplicity and volume 
constraints to the action of BF theory using Lagrange multipliers: 

(4.5.1) 

where the Lagrange multiplier field s^*^*^™ satisfies s^'^'^^*^ = ^l^"^^^*^™', and 

e^rv'^b'^'^'^^ = (no sum over z/) . (4.5.2) 

Indeed, varying with respect to Sf, then gives back the constraints 

0, |u, u} C |cr, r, v}, 

n^^^^Ef, = \ ' ' ' ' ' ^' (4.5.3) 

((^tMTvfu, z/ = CT (for some /^) . 

Note that the second line corresponds to the set of constraints (14.4. 19p and not to 
the summed version fl4.4.23p . and that it is clearly sufficient for the geometric inter- 
pretation of S'*^. The field equation from varying with respect to the connection u is 
the usual 

V[;)SS]=0, (4.5.4) 

where V is the covariant derivative for the connection a;"''. The remaining equations 
involve the Lagrange multipliers, as would be expected: 

\e>^'''''R,b,A^] + ^e'^^^-eabcRfAco] + ^r^'^a^^^r. = , ^r^^'^fu = . (4.5.5) 

We have seen that the constraints imply that S'^^ = ±e"' A e^, and when substituting 
this back into the action one will recover general relativity, modulo the possible sign 
ambiguity we have already discussed. 

We leave a complete Hamiltonian analysis of this theory to future work. However, 
we note a feature of the theory that follows directly from the use of linear constraints, 
and from the introduction of the additional variables Ua- 
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As in unconstrained BF theory the initial dynamical variables will be the spatial 
part of the connection ojf' and its conjugate momentum P^^ = ^^e^^^T^abij- We also 
saw that the equation of motion V|^^S^^j = is unaffected by the constraints. Hence 
there will be Gauss constraints of the form 

gcd ^ Q,pcdi ^ ^c^.pedi ^ cj'i^.p^ei (4.5.6) 

on the canonical momenta. Their role is to generate G gauge transformations. 

Looking at the action fl4.5.ip . one would already require that fl4.5.6p should be 
modified to generate gauge transformations on the normals Uaarv] (14.5.11) is only 
invariant under gauge transformations if the three-forms Uaarv are transformed. The 
need for such a modification is also seen if one computes Poisson brackets between 
the linear simplicity and Gauss constraints. Define "smeared" constraints 

C[E] := I Ef'^^^na.rvUjkP^" , g[A] ■= j ^'^cd- (4.5.7) 



One then finds that 

{C[S],^[A]} 



5C[E] 5g[K\ 



^ B '^aarv^ijk 



p^adpb^^ _ ^bdpak 



-C[A • S] - y Ht'^™A'^'^n,<,.„e,,fcP^' , (4.5.8) 



where (A ■ S)^-'''^^'^ = AJ^'E]^''^'^" . The first term alone would imply that ^[A] generates 
gauge transformations, but the second term is an unwanted extra piece. For Q to be 
a generator of gauge transformations, it must be first class {i.e. commute with other 
constraints up to linear combinations of constraints). We can remedy this by adding 
the variables n^^j^p to the phase space, together with their conjugate momenta tt"'^'^^. 
Now we can define a new Gauss constraint 

g'cd ^ gcd _ n^^p-K'^'^t'^P . (4.5.9) 

Then, computing the Poisson brackets of the new Gauss constraint with C[H], one 
finds 

{C[E],g'[K\} = {C[E], gm- I sr™A™ne...e,,fcP«'"= = -C[A ■ E] , (4.5.10) 

as desired. We have however increased the number of phase space variables at each 
point by 32. 
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A similar reformulation of the Gauss constraint, leading to a relaxation of the 
gauge invariance properties of spin network states, has been already suggested by the 
Hamiltonian analysis of the Plebanski theory [2E], and it has been advocated in the 
loop quantum gravity context in [21 1106] as well as spin foam and group field theory 
context piTHlfm]. 

4.6 Summary and Outlook 

We have investigated a formulation of classical BF-Plebanski theory where the con- 
straint S"^ = ±e"' A e^, needed to reproduce general relativity in four dimensions, 
starting from topological BP theory, is imposed through constraints linear in the 
bivector field S"^. The discrete counterpart of a part of these linear constraints (the 
"simplicity constraints"), in fact, has proven very useful in the spin foam approach 
to quantum gravity [HI [57] . 

The corresponding continuum constraints have been easily identified, and can 
indeed be used to replace the quadratic "diagonal" and "off-diagonal" parts of the 
simplicity constraints appearing in the Plebanski formulation. As in the discrete case, 
one needed to introduce a new set of variables Ua which are assumed to form a basis 
of three-forms at each point of spacetime, and are slightly stronger than the quadratic 
constraints: they eliminate two of the four sectors of solutions that are present for 
quadratic constraints. 

In the second part of the analysis we found that the quadratic volume constraints 
of the Plebanski formulation, needed to complete the identification S'*^ = ie'^Ae*, can 
also be replaced by linear constraints, which again are stronger than their quadratic 
analogues. They do not require an additional non- degeneracy assumption on S"''. 
However, a non- degeneracy assumption on the three-forms Ua is still necessary, and 
only when this is imposed one can hope to eliminate all "non-geometric" degenerate 
configurations for E"^, which are feared to dominate the quantum theory in the case 
of quadratic volume constraints. Also, while for quadratic volume constraints the 
variables merely specify normals to sub manifolds {x^ = constant} and hence can 
be independently rescaled arbitrarily at each point, for linear volume constraints 
they directly specify, up to an overall rescaling, the frame field encoding the metric 
geometry, i.e. an orthonormal basis in the cotangent space at each spacetime point. 

We have then analysed the discrete (simplicial) translation of the linear constraints 
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we identified. In the context of spin foams, tfie quadratic volume constraints follow 
from imposing the (quadratic) diagonal and off-diagonal simplicity constraints every- 
where together with closure constraints on the discrete variables S"^(A). We have 
shown a similar property for the linear volume constraints. If (linear) diagonal and 
off-diagonal simplicity constraints and closure constraints for both bivector variables 
E"''(A) and normals na(A) are imposed everywhere, a sufficient set of linear volume 
constraints follows. This means that "non-geometric" bivector configurations can- 
not appear if the additional closure constraint on the normals holds, and the same 
normals are assumed to be non-degenerate. 

We have not performed a complete Hamiltonian analysis of the resulting linear 
constrained BF action for gravity, but only noted that the use of linear simplicity and 
volume constraints immediately requires a modification of the usual Gauss constraint 
to generate a transformation of the normal 3-form variables Ua alongside that of 
the S's; a similar relaxation of the Gauss constraint, which translates at the spin 
foam and discrete gravity level into a closure constraint for simplices, and in the 
canonical quantum gravity context into a generalisation of spin network states, has 
been suggested on more than one occasion in the literature [2], |3l [161 1106] . even if 
its proper implementation at the quantum level has not been yet developed. On 
the classical level, therefore, a full Hamiltonian analysis of the constraints would be 
highly desirable. This would involve adding momenta for the components Sq^, which 
are Lagrange multipliers in unconstrained BF theory, as well as those for the normals 
Ha we have introduced, so that all variables can transform nontrivially under G gauge 
transformations generated by a modified Gauss constraint, as shown. 

Still at the classical level, but with obvious implications for the quantisation, 
one aspect of our construction that deserves further work is the relation between the 
discretised linear volume constraints we have found and the edge simplicity constraints 
used in ^]. As noted, the two sets of constraints appear to be very similar, and their 
role in the classical theory is the same, in particular, they remove (partly) the non- 
geometric configurations from the configuration space (or phase space) of the theory 
and appear as "secondary" in the sense specified above. So it natural to conjecture 
that one is simply a reformulation of the other. The implications for the quantum 
theory are not only due to the dominant role that non-geometric configurations may 
play in the quantum theory, if not removed, but also in the fact that one discrete 
formulation of these constraints can actually be simpler to implement in a spin foam 
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context than the other. 

The possible use of our findings in the spin foam and group field theory con- 
text, and more generally in any quantisation based on the formulation of gravity 
as a constrained BF theory, are in fact most interesting. In particular, it seems to 
be important to explore how a closure constraint on normals could be implemented 
into existing spin foam models, given that we found it to be necessary for the full 
imposition of the geometric constraints on the variables of topological BF. A conve- 
nient setting to do so could be the GFT formulation of [16j, since there the simplicial 
geometry and the contact with classical actions is brought to the forefront. 
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Chapter 5 

SU{3) and its Quotients 

5.1 Introduction 

The Lie group that will play a central role in this second part of the thesis is the special 
unitary group SU{3). In the standard model of particle physics, this group plays two 
important roles: Firstly as the (colour) gauge group of quantum chromodynamics 
(QCD), the version of Yang-Mills theory describing the strong interactions, secondly 
as the group of flavour symmetry which relates different generations of quarks, fa- 
mously introduced in Gell-Mann's "Eightfold Way" [60j. We shall in the following 
focus on the electroweak sector of the standard model [72 | I146[ 1161] . where the gauge 
group is SU{2) x f/(l)/Z2 ^ U{2) |132j . and possible extensions of it, where one 
adds another SU{2) group acting on "right-handed" fields. We are interested in CP 
violating processes which can change quark flavour, which are again described by an 
element of SU{3), the Cabibbo-Kobayashi-Maskawa (CKM) matrix. Ambiguities 
in the definition of the bases with respect to which this matrix is defined mean that 
all matrices obtained from a given element of SU{3) by left or right multiplication 
by an element of the maximal Abelian subgroup ?7(1)^ should be regarded as equiva- 
lent to the original matrix. Hence the space of CKM matrices is the double quotient 
U{l)^\SU{3)/U{iy . In chapter [6] we will make statements about statistics of the 
Jarlskog invariant J, a measure of the magnitude of CP violation, which assume a 
choice of measure on this quotient space. In this chapter we study the geometry of 
SU{3) and its quotients to motivate different possible choices of measure. 

We will also discuss measures on the space of Hermitian and complex 3x3 matrices 
whose construction relies on the knowledge of measures on quotients of SU{3), and 
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which become important if one considers the space of quark mass matrices. 

Let us first introduce coordinates on SU{3) that we find convenient. The Lie 
algebra su(3) is generated by (i times) the Gell-Mann matrices 
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To make the passage from SU{3) to the single or double quotients straightforward, 
we note that any element of SU (3) can be written as 
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(5.1.1) 



where 



and 



U^AlVAr, 



y — Q^'-^^l g-i'wAs gii/As giiwAs gi2A2 



(5.1.2) 
(5.1.3) 



(5.1.4) 

where the ranges of the Euler angles x,y,z and the complex phases w,p,q,r,t are 
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< X ,y , z < — , < w ,p ,q ,r ,t < 27r . 



Explicitly, the matrix V takes the form 
/ 

V = 



cos y cos z 



cos y sm z 



(5.1.5) 



e ™ sin y \ 



— cosa; sm z — e smxsmy cos 2; cos a; cos z — e sm a; sm y sm 2; smxcosy 

y sin a; sin z — e"" cos x sin y cos ^ — sin x cos ;2 — e^^ cos a; sin y sin 2; cos a; cos y J 

(5.1.6) 



whereas the diagonal matrices Al and Ar are 

= diag(e''^ e'(«-^\ e-'^+«)) , = diag(e'(''+*\ e'^'-*), e"'^") . 



(5.1.7) 



It should be clear the orbits of the left action of the subgroup f/(l)^ are parametrised 
by (p, q) , while the orbits of the right action are parametrised by (r, t) . Therefore one 
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may use {x,y,z,w) to parametrise the double quotient f/(l)^\S'f/(3)/f/(l)^. Indeed, 
fl5.1.6p is commonly used in the literature as a parametrisation of the CKM matrix. 

We may now obtain left-invariant forms on 5*^7(3) by computing the Maurer- 
Cartan form 

U'^dU = i\acra. (5.1.8) 

The resulting expressions are somewhat involved, and we use the usual complex nota- 
tion to slightly shorten the expressions (taking real and imaginary parts one recovers 
the (Tj which are all real), 

c"i + 1(72 = ie^'* uj — e^'* sin y cos(22;) (sin w dx + cos w sin(2x) dq) + -e^^* sin(22;) Q , 

(73 = - cos(22;) Q + dt + siny sin(2z) (sin wdx + cos w sin(2x) dq) , 

(74 + ids = ie'(=^"+*+'") cos zdy- cos y sin z{ie'^^''+''> dx + e'^^''+'^ sin(2x) dq) 
_j_gi(3r+t+«)) j,Qg y gjj^ y j,Qg 2; ( 3 (^p " c^w cos(2x) dq) , 

(Te + iar = ie^(3'-*+«') sin zdy + cos y cos ;z(ie^(^"-*) da; + e^^^'^"*) sin(2x) dq) 

_^gi(3r-i+«)) ^^^y ^^^y sin^(3c^p - dw + cos(2x) dg) , (5.1.9) 

as = \—dp + Adr + dw + 2dq cos(2x) cos^ y + {3dp — dw) cos(2y)] , 

where we defined the 1-forms 

u = dz + dx cos w sin y — 2dq cos x sin w sin x sin y , 

n = 3dp + dw + dqcos{2x){cos{2y)-3) + {3dp-dw)cos{2y). (5.1.10) 

A general left-invariant metric on the group will have the form 

ds^ = g'^'aaab (5.1.11) 

for some constant symmetric matrix g"'^. Depending on the choice of g"''', the resulting 
metric will have a certain subgroup of SU{3) as isometrics acting by right multipli- 
cation on SU{3). The possible cases are detailed in [H]. The most symmetric case is 
the bi-invariant metric, invariant under SU{3)i^ x SU{3)ji, which results from taking 
gab proportional to 6°"^. 

Since we will be mainly interested in the measure, we note that for any choice of 
g"''' the (left-invariant) volume form is, up to an (irrelevant) overall constant, 

l^uJ^^ °^ CTi A (T2 A CTg A (14 A Cr5 A CTg A (T7 A (Tg (5.1.12) 

3a/3 

= — — sin(2x) sin y cos^ y sin(22;) dx A dy A dz A dw A dp A dq A dr A dt , 
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which is independent of the complex phases w,p,q,r and t. Clearly left invariance 
fixes the volume element up to an overall constant: Pick a group element g (say the 
identity), then the space of 8- forms at g is one-dimensional. The volume form at any 
other group element is then determined to be the push-forward of the volume form 
at g under left translation. The volume form (15.1.121) will be the central quantity in 
many of the following calculations. 

One could of course have started demanding right-invariance, computing the right- 
invariant forms from 

dUU-^=i\aTa. (5.1.13) 

If the left- and right-invariant measures are identical the group is called unimodular. 
Compact and semisimple groups are unimodular. Hence we can use left and right 
invariance as interchangeable conditions on a natural measure on SU{?)). 

If G is compact and semisimple, (the negative of) the Killing form is a bi-invariant 
Riemannian metric which is also an Einstein metric [9] . It may therefore be regarded 
as the most natural choice of metric on such a group. For a (complex) matrix Lie 
group this metric is defined by 

ds'^ = i:i{dgdg^) . (5.1.14) 

It can also be viewed as the induced metric on the group G, interpreted as a sub- 
manifold of C"^" with flat metric. (The same considerations are of course true for a 
group of real matrices.) 

We will use this metric as the most natural choice later on. It does give the same 
measure as any left-invariant metric, as we have noted. For completeness, we give its 
form explicitly for SU{'i): 

ds^ = ^Tr(df/df/t) =5"V^a6 

3 

= 3 + dq'^ + 3 rfr^ + dt"^ + - (3 cos(2?/) -l)dpdr 
+?> cos^ y (cos(22;) dp dt + cos(2x) dq dr) 

+ - [cos(2a;) cos(2z) (cos(2?/) — 3) + 4 sin(2x) sin(22;) sin?/ cosw] dqdt 
— sin^ y {3dp — 3dr + cos(2x) dq — cos(2z) dt)dw 
+2 sin y sin w (sin(2z) dt dx — sin(2x) dq dz) 

+dx^ + dy^ + dz^ + siv? y dw'^ + 2 siny cos wdxdz . (5.1.15) 
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5.2 The Homogeneous Space SU{3)/U{lf 

The coset space S'f/(3)/f/(l)^ is a reductive geometry as explained in section ITTT 



The sphtting of su(3) into an Abehan subalgebra u(l)^ and the orthogonal subspace 
is invariant under the adjoint action of SU{3). We may hence identify an appropriate 
subset of generators of SU (3) and use the corresponding one-forms to define metrics 
on the homogeneous space SU{3)/U{iy . 

The one-forms (J3 and ag correspond to the Gell-Mann matrices generating the 
Abehan subgroup U{iy. We can construct left-invariant metrics on SU{3)/U{1)'^ 
from the remaining six left-invariant forms Ui, cr2, CT4, . . . ,a-j. The left group action of 
SU (3) on the homogeneous space again guarantees the existence of a unique (up to 
a constant) left-invariant volume form, and we find that this unique volume form is 

^^u{3)/u{if ^ Aa2 Aa4A(T5 A(T6 Aar (5.2.1) 
3^/3 

= ^ sin(2x) sin y cos^ y sin(22;) dx A dy A dz A dw A dp A dq , 

where we note that, due to the SU{3) decomposition (15.1. 2p . we may parametrise 
the quotient SU{3)/U{iy by the SU{3) coordinates {x,y, z,w,p,q). This is of course 
the same volume form as the one obtained by starting with the left-invariant volume 
form on SU (3) and integrating over the coordinates r and t, or by just considering a 
slice {r = constant, t = constant} in SU{3) with the measure (I5.1.12p . (We shall see 
in the next section that this need not be true if one reduces to the double quotient 

u{iy\sui3)/u{iy.) 

There is an additional possible source of non-uniqueness of the measure, namely 
a possible ambiguity of the definition of the same manifold as different homogeneous 
spaces X = G/H = G' / H' . In practice, this does not seem to lead to different results 
for the measure. 

It should be noted that the homogeneous space considered here, where the sub- 
group is the maximal Abelian subgroup, the "maximal torus", of a compact group, is 
what is known as a flag manifold. Such a manifold is naturally a Kahler manifold. For 
any G-invariant complex structure on a flag manifold, there is a unique G-invariant 
Kahler-Einstein metric |i9J. This metric may then be regarded as the most natural 
choice on such a homogeneous space. An explicit construction for such metrics on 
spaces SU{M) /U{1)^^~^ where U{1)^~^ is the maximal torus of SU{M), is given in 
|136] . The case M = 6 might be of interest for neutrino mixing; here the analogue of 
the CKM matrix is the Maki-Nakagawa-Sakata matrix [114], usually taken to be 
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an element of the single quotient f/(l)^\S'f/(3). In the see-saw mechanism one adds 
very heavy right-handed neutrinos, and the most general mixing matrix would be an 
element of U{1)^\SU{Q). The construction of |136] would be the starting point of an 
analysis of this quotient, which is beyond the scope of this thesis. 

5.3 Metrics on U{lf\SU{3)/U{lf 

The double quotient f/(l)^\5'f/(3)/f/(l)^ does not have a natural SU{3) group action, 
because the left and right actions of SU (3) on itself do not commute with the Abelian 
subgroup f/(l)^ and hence do not induce a group action on the double quotient: 
If g and g' are in the same equivalence class, [g] = [g'], then g = hg'h' for some 
h,h' G U{1)^. For a group action to be defined on the double quotient, one would 
need [j ■ g] = [j ■ g'], j ■ g = k{j ■ g')k' = k{j ■ h~^gh'~^)k' for some k, k' G U{iy. This 
can only be done for those j that commute with h~^. 

It is therefore less straightforward than in the previous cases to find a geometrically 
motivated measure. The simplest way of reducing the left-invariant measure on the 
group SU{3) or the homogeneous space SU{3)/U{iy to the double quotient would 
be to just integrate f l5.2.ip over the coordinates p and q, which is trivial since the 
volume form fl5.2.ip is independent of p and q. One would therefore essentially use 
the same measure for the double quotient. 

Alternatively, one might start from a metric on either the group or the homoge- 
neous space, reduce it in some way to the double quotient, and then determine the 
associated volume form, but this introduces two possible sources of ambiguity; the 
choice of metric and the choice of method of reduction. We will see that indeed the 
measure on the double quotient is vastly non-unique. 

For example, one may be tempted to identify the double quotient with the sub- 
manifold p = g = r = t = Oof SU{3), take a symmetric metric on SU{3), say the 
bi- invariant metric flS.l.lSp . and set p = q = r = t = 0. This approach was indeed 
followed by Ozsvath and Sch ticking |126] , and leads to an appealingly simple form of 
the metric 

ds'^ = dx^ + dy^ + dz^ + 2sva.y cos w dxdz + sin^ y dw'^, (5.3. 1) 

but seems not to be justified geometrically: It corresponds to a gauge choice, a choice 
of section in the orbits of U{1)^ x f/(l)^ over the double quotient, and introduces 
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symmetries (three commuting Killing vectors) into the metric which should not be 
present. It seems much better motivated geometrically to do a Kaluza-Klein reduction 
of a metric, i.e. to project it orthogonally to the orbits of t^(l)^. One writes a given 
metric on SU{3) (say) as 

ds^ = hij{x) {dy' + A\{x)dx^){dy^ + A\{x)dx'') + g^,y{x)dx^ dx" , (5.3.2) 

and identifies g^u as a metric on the quotient space parametrised by (a;^). 

To understand what is going wrong in f l5.3.ip . we compare with the simpler case 
of SU{2). The bi- invariant metric on SU{2) is 

rfs^ = {dip + cos e d(j)f + de^ + sin^ 9 , (5.3.3) 

where ^ generates the left action of U{1) and ^ generates the right action of U{1). 
Projecting the metric orthogonally to the orbits of right translations, d la Kaluza- 
Klein, gives the round metric 

ds"^ = de^ + sinH d(f)^ (5.3.4) 

on SU{2)/U{1) = S^. By contrast, simply setting dip = (the analogue of the 
construction of Ozsvath and Schiicking) instead gives the flat metric 

ds^ = de^ + . (5.3.5) 

The round metric f l5.3.4p is invariant under 5*0(3). The flat metric (15.3.51) appears 
to be invariant under the Euclidean group, with ^ and ^ having the appearance of 
translations, but these are only local symmetries since is a periodic coordinate and 
6 lies in an interval. 

The example of SU{2) also illustrates the difference between taking the left- 
invariant measure on the homogeneous space and a measure obtained by Kaluza-Klein 
reduction of a metric to a biquotient. The biquotient U {1)\SU (2) / U (1) = U{1)\S'^ is 
just an interval. Its metric becomes, after performing another Kaluza-Klein reduction 

of dssa]), 

ds^ = de^ . (5.3.6) 

The measure would be dO, and not d9s\.n9 as obtained by integrating a function 
f{9) over the coordinate cp. It is apparent from this simple example that there are 
inequivalent ways of calculating integrals of a function on a right quotient that is 
invariant under the left group action; namely, one can either reduce the metric to 



SU{3) AND ITS Quotients 



114 



obtain a measure on the double quotient or take the measure on the single quotient 
and integrate out the left phases. 

Performing a Kaluza-Klein reduction of the bi-invariant metric (15.1.151) we find 
a rather complicated metric on the double quotient, whose determinant gives the 
measure 

det g oc sin 2x sin 2z sin y cos^ y ■ ^(sin^ 2x + sin^ 2z) sin^ y 

1 1 
+ -(5 cos2y — 3) sin^ 2x sin^ 2z + - sin4x sinAz sin^ y cosw 

1 ~ ~ 

+ -(3 cos2?/ — 5) sin^ 2a; sin^ 22; sin^ 1/ cos^ w ) . (5.3.7) 
8 / 

One might think that, starting from a more general ("squashed") left-invariant 
metric on the homogeneous space SU{3)/U{1)'^, there might be some which after 
Kaluza-Klein reduction give rise to a simpler expression for the measure on the biquo- 
tient, but we have only found more complicated ones. 



5.4 Hermitian and Complex Matrices 

In chapter |H] we will require measures on the space of either all Hermitian or all 
complex 3x3 matrices which will appear as the space of quark mass matrices in the 
standard model of particle physics or certain extensions of it. These will involve the 
left- (or right-) invariant measure on the homogeneous space SU{3)/U{iy , as we will 
see shortly. Let us start with the space of 3 x 3 Hermitian matrices. It has a natural 
group action by U{3) by conjugation, corresponding to a change of basis. We would 
like to determine a metric invariant under this action. 

Any Hermitian matrix can be diagonalised, i.e. written in terms of a real diagonal 
and a unitary matrix: 

Mn = U^DU, (5.4.1) 

It should be clear that U is only defined up to left multiplication by elements of 
f/(l)^; U should be regarded as an element of the homogeneous space U{1)^\SU{3) ~ 

su{3)/u{iy. 

The most symmetric measure on the space of Hermitian matrices is induced by 
the bi-invariant metric 

ds'^ = TiidMudMn) = Tr (dD rfD) + 2Tr [{dU of - {dU U^f D^^ (5.4.2) 
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which is clearly invariant under the action of U{3) by conjugation. Define right- 
invariant one-forms Ta on SU (3) by 

dUU^ =iXaTa, (5.4.3) 

this becomes*] [with D = diag(Z}i, D2, -D3)] 

ds'^ = Tr {dD ■ dD) - 2rarf,Tr {\a[D, \b]D) 

= dDl + dDl + dDl + 2{{D,- D^firl + r^) + {D, - D^firl + r^) 

+ {D,-D,nTl + T^)] . (5.4.4) 

The corresponding volume form is 

{Di - D2f{Di - D^f{D2 - D^f dDl A dD2 A dD^ An At2 An An Atq Atj . (5.4.5) 

We have explained above that any left- or right-invariant measure on the coset 
U{iy\SU{3) must be given by (15.2.11) and hence there is no need to compute the 
Ta explicitly. We obtain a Riemannian measure 

DMu := (Di- D2f{Di - D:if{D2 - D^f sin(2x) cos^ y sin y sin(2z) dDi dD2 dD^ 
X dx dy dz dw dr dt (5.4.6) 

on the space of Hermitian 3x3 matrices. Note that we use coordinates (x, y, z, w, r, t) 
on U{lY\SU{?>) according to the decomposition ( I5.1.2p . The measure ( I5.4.6P favours 
strongly non- coinciding eigenvalues. 

From (15.4. ip . it is apparent that each Hermitian matrix with three distinct eigen- 
values is associated with six different elements of x U{lf\SU{3), related by the 
action of the discrete group 63: 

Mh = U^DU = {U^P-^)PDP-\PU) =: fj^ DU , P G 63 , (5.4.7) 

where 63 is the symmetric group of degree 3 (the dihedral group of order 6, sometimes 
denoted by D3 or D^) which permutes the canonical basis vectors of M?. The set of 
matrices with coinciding eigenvalues has zero measure and hence can be ignored in 
the present discussion. 

Thus we need to consider the space 

X {U{lf X e^)\SU{?,) insteacill, restricting 
the coordinates on the quotient U {1)'^\SU {3) to an appropriate range to pick one of 



* Compare with the corresponding resuh for real matrices given in |71) 

■fNote that the space of Hermitian matrices is not topologicahy identified with this space; we 
have discarded a subset of Hermitian matrices which has zero measure for our construction. 
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the six matrices related by the 63 action. We can use the fact that the 63 action 
permutes the rows of an SU{3) matrix to demand that the elements of the third 
column (cf. f l5.1.6p : note that absolute values will not depend on rephasing, i.e. right 
multiplication by an element of t/(l)^) satisfy the relation 

|sin?/| < I sin x cos 2/1 < |cosxcos?/|, (5.4.8) 

which restricts the coordinates x and y to 

< y < arctan(sinx) , < x < — . (5.4.9) 

Using the right-invariant measure (15.2. ip on U{1)'^\SU{3), we see that this region 
has precisely one-sixth of the total volume: 

7r/2 7r/4 arctan(sinx) 

j dz j dx J dy sin 2x cos^ y sin y sin 2z 

° ° = -• (5.4.10) 

tt/2 7r/2 tt/2 6 

J dz J dx J dy sin 2x cos^ y sin y sin 2z 



Notice that while the total volume of (f/(l)^ x &3)\SU{3) will be finite, the in- 
tegrals over the M.^ part will diverge; the eigenvalues of arbitrary Hermitian matrices 
can of course be arbitrarily large. One would have to introduce some kind of cutoff 
to make integrals over the space of Hermitian matrices convergent, as we will do later 
on. 

The space of 3 x 3 complex matrices can be discussed along similar lines. Here, 
one has a left and right action by independent elements of U{3). We should therefore 
determine a metric invariant under both of these group actions. An arbitrary complex 
matrix can be represented as 

M, = uIdUr, (5.4.11) 

where Ul, Ur are unitary and D = diag(Di, D2, D^) is real diagonal. Ul and Uji are 
only defined up to simultaneous left multiplication by a diagonal unitary matrix 

Ul^AUl, Ur^AUr, AeU{lf (5.4.12) 

which so that one has the correct number of real parameters, only 18 instead of 21. 
Since here the signs of the elements Di can be changed by a different choice of Ul 
and Ur, and one can permute the as for Hermitian matrices, there are additional 
discrete ambiguities given by elements of the group 63 x Zg. We will see shortly 
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that the measure, similar to the measure for Hermitian matrices, vanishes whenever 
elements of D coincide up to sign, so we can restrict to matrices with D\ ^ D\ ^ 
Dl ^ Dl. Hence we can identify this subspace of the space of 3 x 3 complex matrices 
with X {U{lf X ©3 X Zl)\{U{i) X f/(3)) ~ ^ {u{lf x ©3)\(;7(3) x ^7(3)). 
The metric we will use to determine a measure is 



ds^ = Tr((ZMc dMl) , 



(5.4.13) 



which clearly is invariant under M — > OMO' for O, O' e U{3). To evaluate this, use 



dMr 



-Ul dUL uIdUr + Ul dD Ur + uId dUn , 



dM] = -UldUnUlDUL + UldDUL + UlDdUL 

and [D, dD] = to obtain 

Tr(dMc dMj) = Tr(dD dD) - Tr{D^{dUL Ulf) - Tr{D^{dUR Uj^f) 
+2 Tt{D dUL ul D dUR ul) . 

We introduce right-invariant one-forms 

dULUl^iXaTl, dURUl^iX,T', 



(5.4.14) 



(5.4.15) 



R 

where Xi, . . . , Xg are the Gell-Mann matrices, and 

/l 0^ 
A9 = y^ 10 
[o l) 

so that iXa are a basis for the Lie algebra u(3). Then 



(5.4.16) 



(5.4.17) 



Tr(dMerfMt) = Tr(ciDdD)+Tr(D^ A(„A,))(r^^+r]^r]i)-Tr(D A„D A,)(T2r^+r^r^) . 

(5.4.18) 
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The only nonvanishing traces in (15.4.18^ are 



Tr(D'AiAi) = Tr(D'A2A2) = Trp^AgAg) = TiiDX^OXs) = Df + D'^ 
Tr(Z}2A(3A8)) = Tr(DA(3/^A8)) = -^p? - DI) , 

Tr(Z}2A(3A9)) = Tr(DA(3Z^A9)) = ^J^iDl - Dj) , 

/2 

Ti{D^\s\9)) = Tr(DA(8Z^A9)) = ^{Dl + Dj - 2Dl) , 
Ti{D^\^\^) = Ti{D^\^\^) = Dl + Dl , 
Tr(D2A6A6) = Ti{D^\jM) = DI + DI , 

TiiD^XsXs) = Tt{DXsDXs) = ^{Dl + Dl + Wl) , 

2 
3 

Tr(DAiDAi) = Ti{DX2DX2) = 2D1D2 , 



Ti{D^X,X,) = TiiDX.DX,) = -{Dl + Dl + ^3^) , 



Ti{D^X^X^) = Ti{D^X^X^) = , 

Tr(D2A6A6) = liiD'^XjXj) = 2D2D3 . (5.4.19) 
The metric can be written in the form 

ds' = dDl + dDl + dDl + l{D,-D2nTl + rl,r + l{D, + D2nrl-r'^r 

+ 1 [Pi - D2nrl + r|)^ + (D, + D2)^(r| - r^)^ 
+ - Dsfirt + rt,f + {D, + D3)^(rl - r'^f] 
+ \ [{D, - D,)\tI + rlf + (Dl + D,)\tI - rlf 
+ {D2 - D,f{rl + r|)^ + {D2 + D^firl - r|)^] 
+ \{D2 - D.nrl + rj,r + ^(1^2 + D.nrl - rlf 




(5.4.20) 



The fact that the metric only depends on r| — r|, etc., and not on r| + etc., again 
reflects the U{1)^ that has to be factored out. One finds that the volume form is 
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proportional to 

{Dl-Dlf{Dl-Diy{Dl-Dl)^\DiD2D,\ rfDiArfDsArfDaAriAr^A. . .A(r«-r|)A(rl-r^) . 

(5.4.21) 

Note that this expression only depends on the absolute values of Di, as expected. 
Since the range of the Di is infinite, integration over these coordinates will give an 
infinity, as was the case for Hermitian matrices. Note the main difference to the 
measure fl5.4.6p in the different powers of Di. 

Now assume that all quantities we are interested in are independent of the pa- 
rameters on Ur, which will be the case in the discussion of chapter El Then we 
can integrate over these coordinates, obtaining a constant which is irrelevant in the 
averaging process. 

We are then left with integrating over the space of possible matrices Ul, the coset 
f/(l)^\f/(3) = U{iy\SU{3), and the volume form is proportional to 

{Dl-Dl)\Dl-Diy{Dl-Diy\DiD2Ds\dDiAdD2AdDsArlATlATtATlATlATl. 

(5.4.22) 

In terms of the coordinates on U{1)^\SU{3) introduced in fl5.1.2p . the wedge product 
of right-invariant forms gives the usual measure on SU{3), so that we finally get 

DMc = Y\_{D^—D'^Y\DiD2D3\ sm2x cos"^ y siny sm2z dDi dD2 dD^ dx dy dz dw dr dt. 

(5.4.23) 

The discrete 63 symmetry can be taken into account just as in the previous case: 
We integrate only over one sixth of the homogeneous space U{1)^\SU{3), correspond- 
ing to 

< y < arctan(sinx) , 0<x<^. (5.4.24) 

This restriction amounts to removing unitary matrices that permute the elements of 
D and hence to fixing an ordering. 
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Chapter 6 

Naturalness of CP Violation 

"Well, I don't like to get involved in these philosophical 
issues very mv.ch. " 

Murray Gell-Mann (attributed) 

6.1 Introduction and Motivation 

In this chapter we will apply the mathematical theory of measures on quotients of 
SU{3), outlined in the previous chapter, to the space of CKM matrices in the elec- 
troweak sector of the standard model in order to make statements about naturalness 
of the observed magnitude of CP violation in electroweak processes. We will see that 
the observed value is rather small when compared to the maximal possible magni- 
tude and aim to reach a conclusion if this should be viewed as a fine-tuning problem. 
The task of making mathematically precise statements is simplified by the fact that 
SU{3) is compact, and so normalisation of measures on SU{3) or a quotient of it will 
normally not be problematic. 

In modern theoretical physics one often tries to make statements about "natu- 
ralness" or "fine-tuning" of the observed values of fundamental parameters, where 
fine-tuning of a parameter is interpreted as an indication for incompleteness of the 
theory. Popular examples of fine-tuning problems include the quark mass hierarchy 
and the cosmological constant problem in particle physics. Since statements about 
naturalness are fundamentally of a statistical character, to make them mathemati- 
cally precise one has to assume a well-motivated probability distribution on the pa- 
rameter space relevant for the theory. A fine-tuning problem then indicates that the 
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probability distribution one has used should be modified by introducing new physical 
considerations. As an example, as long as observation was consistent with a vanishing 
cosmological constant, it seemed reasonable to assume that a postulated symmetry 
would constrain it to vanish. With more recent observations indicating that it must 
be taken to be very small and positive, there seems to be an issue of fine-tuningj. We 
note that much of the motivation to extend the standard model of particle physics 
is driven by such considerations, that it is by no means necessary to contemplate a 
"Multiverse" where all possible values of a given parameter are actually realised. We 
also need not consider anthropic arguments, which for the problem at hand would not 
give a satisfactory explanation of very weak CP violation; strong CP violation might 
be anthropically preferred since baryogenesis necessarily requires CP violation, and 
the CP violation in the electroweak sector does not seem sufficient jg7j . 

Of course we can only ever observe and make measurements in a single Universe, 
and if a parameter takes a value that appears unlikely maybe this just means that 
an unlikely possibility is realised in our Universe. One is merely doing statistics. But 
one should recall Bayes' theorem 



P(B\A)P{A) P{B\A)P(A) 

^'-^1^'= PiB) =E.,i'(B|A.)m.)' 

where one can take A as a hypothesis and B as an observation. In order to calculate 
the probability that a hypothesis follows from a given observation, one either needs a 
priori probabilities for both A and B and the conditional probability of B assuming A 
(which should be calculable), or a priori probabilities for a complete set of hypotheses 
{y4j}, together with all conditional probabilities of B given these hypotheses. These 
a priori probabilities are referred to as priors. In the second case the sum also has 
to be made explicit, and if it is an integral one needs to specify a measure. 

In cosmology, the role of priors is particularly important as there is no possibility 
to confirm observations in different experimental situations, and P{B) does not seem 
accessible. Here A would be a statement about our Universe. While it may be 
appropriate to consider P{A) as independent of A (Laplace's Principle of Indifference), 
a measure on the set of all universes has to be determined [61] . 

The situation considered in this chapter is similar, as "experiments" would involve 
different universes with different values of the fundamental physical parameters. For 
instance, one could try to use anthropic arguments, taking B to be the observation 



"For an alternative interesting but presumably non-mainstream viewpoint, see 
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that human beings have evolved in our Universe, to claim that the values of (in this 
case) CP violating CKM matrix parameters must take values close to the observed 
values (hypothesis A). The conditional probability P{B\A) could in principle be 
calculated using baryogenesis etc., but then one still needs a measure on the space 
of possible values that A can take. Again, one is forced to pick a measure on the 
parameter space. We shall therefore investigate predictions for the magnitude of 
CP violation, measured by the Jarlskog invariant J, comparing different choices for 
the measure on the parameter space that we regard as natural. The observation by 
Kobayashi and Maskawa [98j that CP violation is only possible for at least three 
quark families has led to a Nobel prize, but the issue of possible fine-tuning in the 
magnitude of CP violation is much less understood. 

It is true that, if one is talking about parameters in quantum field theory, they 
should really not be regarded as constants, but have an evolution with energy scale 
given by renormalisation group equations. However, since fine-tuning means a dis- 
crepancy of several orders of magnitude, it may well be that a fine-tuning problem 
is present at all energy scales. This is true for the quark mass hierarchy |17U] . and 
also for the case oi CP violation: Recent numerical studies [36] indicate that does 
not run strongly with energy scale, but that the value at extremely high energies 
(~ lO^^GeV) is merely about twice the value at low energies. It is then meaningful 
to talk about "naturalness" of the value of such a parameter. 

Since any statements one tries to make depend very directly on the choice of mea- 
sure, it is helpful if geometric considerations allow for a natural choice of probability 
distribution. We saw in the previous chapter that, if the parameter space is a homo- 
geneous space G/H for G a compact Lie group and H a closed subgroup, the natural 
requirement on the measure determining the probability distribution is invariance 
under the left action of G, which leads to a unique measure (up to normalisation). 
Probabilities for a given function on C/H to take certain values are then well-defined. 

We also saw that if, as in the case of the space of CKM matrices, one has a double 
quotient H\G/H, the problem of determining a natural probability distribution is 
more involved than for a homogeneous space G/H. We have discussed several possible 
choices that we compare here. We will find that while there is no clearly preferred 
choice of measure, there always seems to be fine-tuning in the observed value for J, 
unless additional input is used. 

In a second part we shall take a different approach, taking the observed values for 
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the quark masses into account by considering not the space of CKM matrices, but the 
space of mass matrices, as the fundamental parameter space. This is motivated by 
the observation that the mass matrices are directly hnked to the Yukawa couphngs 
and the Higgs vacuum expectation value, whereas the CKM matrix is only a derived 
quantity. The observed values for the quark masses will be taken as given, and a 
probability distribution be constructed that can reproduce these values. The choice 
made for this distribution will be as simple as possible in the following sense: The 
natural group action on the space of Hermitian mass matrices is the action of the 
unitary group f/(3) by conjugation, M — )■ UMW. There is essentially a unique 
measure invariant under this action. This measure is then modified by introducing 
the simplest possible function that would allow for a modification of the expectation 
values for quark masses fitting observation. No further assumptions will be needed. 
We will then find that this simple choice for the measure gives an expectation value 
for J that is remarkably close to the observed value. Hence we will conclude that 
once one assumes the quark masses as given, one does not face an additional fine- 
tuning problem with J. This statement, while not new, is hence made precise using 
a geometrically motivated measure on the parameter space. 

In the standard model one can restrict, without loss of generality, to Hermitian 
mass matrices, which is why we construct a measure on the space of 3 x 3 Hermitian 
matrices. However, in left-right symmetric extensions of the standard model, such as 
Pati-Salam, such an assumption can no longer be made and the mass matrices have 
to be regarded as arbitrary complex matriceJl]- This presumably has an effect on the 
statements one makes about naturalness of J; a similarly well-motivated measure on 
the space of all 3 x 3 complex matrices might lead to very different results. We will 
therefore in a further calculation redo the same analysis for general complex matrices. 
We will again use the most symmetric measure on the space of mass matrices, here 
the space of general complex matrices, and modify it in the simplest possible way to 
incorporate the observed quark mass hierarchy. While this is a choice that could of 
course be made very differently, it is a simple choice that uses as few assumptions 
as possible, and that has worked very well for the case of Hermitian mass matrices. 
What we will find is that the resulting probability distribution is different, and the 
result is different too: The observed value for J now appears to be unnaturally large, 
since CP violation should be more heavily suppressed by the quark mass hierarchy. 
^We thank Ben AUanach for pointing this out. 
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One faces a fine-tuning problem, and needs additional assumptions to modify the 
measure appropriately. 

We should point out that the only real input we use from the physical theory 
(standard model or a left-right symmetric extension of it) is, apart from the very def- 
inition of CP- violating parameters, how the theory restricts the type of mass matrices 
that appear. In particular, for any extension of the standard model (such as left-right 
symmetric models where parity is the left-right symmetrjj^]) that also has Hermitian 
mass matrices we would not see any modification in the results. Nevertheless, our 
calculations provide another example of how symmetries, in this case the presence or 
absence of a left-right symmetry, influence physical predictions of a given theory. 

The structure of the remaining part of the chapter is as follows: In section 16.21 
we review CP violation in the electroweak sector and detail why mass matrices may 
be assumed to be Hermitian in the standard model, but not if one has an extended 
left-right symmetry. In section 16. 3[ we compute expectation values for the Jarlskog 
invariant J using several of the measures on the space of CKM matrices that we have 
introduced. We also perform a more detailed analysis of how unlikely the observed 
value for J appears in these distributions. In section 16.41 we focus on the space of 
mass matrices instead, detailing how the observed values for the quark masses can be 
incorporated into a measure and how this completely changes predictions about likely 
values for J . We briefly mention how our analysis relates to the case of neutrinos and 
close with a summary and discussion of our findings. 

6.2 CP Violation in the Standard Model and Beyond 

We summarise how CP violation arises, first in the standard model and then in 
the more general case of left-right symmetric extensions, essentially following pi] . 
In the standard model, the quark fields appear as left-handed SU{2) doublets and 
right-handed SU{2) singlets: 



Here N is the number of quark families, which is arbitrary in the standard model, 
and normally taken to be three. The fields are written in a flavour basis which can be 
considered unphysical, since flavour eigenstates do not correspond to mass eigenstates. 
'''We thank an anonymous referee for Phys. Rev. D for clarification on this point. 
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The coupling of the Higgs doublet H to quarks is given by 

N 

^Cmggs = {^jtMUH^qkR + Y;,{q,q'),LHq',j, + h.c.) , (6.2.2) 
j,k=i 

where the Higgs doublet H and its C-conjugate H'-' can be written as 

H^±('^' + ''^A, H^^^('^'-''^A, (6.2.3) 
v2\^0o + i03/ V 2^-01+102/ 



and Yjk and Yjj, are complex (Yukawa) couplings. H transforms under SU{2) as 
a doublet and the term /^Higgs is invariant under SU{2). This symmetry is then 
spontaneously broken by the Higgs potential V{H) which gives rise to a vacuum 
expectation value v for The remaining components 4>j are 'eaten' by the W and 
Z bosons, which become massive, and the only remaining terms involve 0q = 0o — 
giving masses to the quark fields: 

^Higgs ^ - ^ {rrijkqJZqkR + m'jkQjLQkR + h-c) + . (6.2.4) 

j,k=i ^ ^ 

The new mass matrices m and ml are (up to a factor of -\/2) given by the original 
Yukawa couplings, multiplied by v. Thus, it seems appropriate to regard either the 
set of Yukawa couplings together with the Higgs vacuum expectation value or the 
collection of elements of m and m' as fundamental parameters of the theory. 

This part of the Lagrangian is formally CP invariant if and only if m and m', which 
are so far arbitrary complex matrices, are real. Since this condition is not satisfied in 
Nature, one has formal CP violation. However, as remarked before, the Lagrangian 
has been written in the unphysical fiavour basis. One can, for general m and m', pass 
to a different basis, namely the basis of mass eigenstates, by diagonalising the mass 
matrices with unitary matrices: 

m^UlMJn, m' = {U'LyA'U'R, (6.2.5) 

thus the basis of mass eigenstates is related to the previously considered basis by 

?Lphys = UlQl , qRphys = UrQr , CtC. (6.2.6) 



It is always possible to choose the unitary matrices so that A and A' are real, and 
in this new basis this part of the Lagrangian is invariant under C and P separately. 
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and hence also under CP. However, the electroweak Lagrangian also contains charged 
current terms mixing up- and down-type quarks, coupled to the W boson fields via 
(we are now using the basis of mass eigenstates) 

ld,\ 

Xc-={Wl-iWl)Jl^ + \i.c., Jr.= {u,c,t)LrV SL , (6.2.7) 

where V := UiiU'^Y is the Cabibbo-Kobayashi-Maskawa (CKM) matrix. In the 
basis of mass eigenstates, this term Xc is not invariant under CP unless V is real. 
Since we consider the mass eigenstates as physical, CP is violated through these 
charged current terms. 

An important observation made in [SS] is that one can redefine the right-handed 
quark bases by arbitrary unitary transformations, 

Un^OUn, U'^^O'U'j,, (6.2.8) 

obtaining a new basis which is to be regarded as equally physical. This is due to 
the absence of charged current terms involving right-handed quarks, since they are 
singlets under SU{2). It is therefore no loss of generality to set Ur = Ul and f/^ = 
in fl6.2.5p . and to assume that m and m' are Hermitian. 

A natural way to extend the standard model is to assume the existence of a 
second SU{2) symmetry which acts on the right-handed quarks, as in the Pati-Salam 
model [128] . In such extensions, one adds a term (16.2.71) for right-handed quarks to 
the Lagrangian. This has the important consequence that a general transformation 
(16.2.81) for arbitrary unitary transformations O and O' can no longer be regarded as 
giving an equivalent quark basis, since it modifies this new charged current term. The 
mass matrices cannot in general be taken to be Hermitian, but are arbitrary complex 
matrices. 

There are now also two possibly CP violating terms, and two CKM matrices. We 
focus on V = ULiU'i)"^ which involves the processes that are actually observed [12] 
and disregard Vr = UR{U'j^y in the following. 

Mathematically, V is an element of SU{N), but since the phases of the quark 
fields are arbitrary (even for non- Hermitian mass matrices), V is only defined up 
to left or right multiplication by a diagonal element of SU{N), i.e. an element of 
the maximal torus U{1)^~^. The space of CKM matrices is therefore the double 
quotient U{1)'^~^\SU{N)/U{1)'^~^ , characterised by {N — 1)^ parameters, out of 
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which \N[N — 1) may be taken to be real (Euler) angles and the remaining |(A^ — 
1)(A^ — 2) appear as complex phases. It follows that the matrix V can be taken to 
be real for N = 2, and so in the formalism explained here one needs at least three 
quark families to have a possibility of CP violation. Kobayashi and Maskawa were 
awarded the 2008 Nobel Prize for using this observation to predict the existence of a 
third quark family [98J. We set = 3 in what follows. 

The mathematical theory of observable measures of CP violation in the standard 
model was developed by Jarlskog [HI] • She showed [89J that all necessary and sufficient 
conditions for CP violation can be summarised as the following condition on the 
commutator of the Hermitian matrices m, m': 

detC := det(-i[m,m']) ^ 0. (6.2.9) 

One finds that explicitly 

detC = -2J{mt-mc){mc-mu){mu-mt){mb-ms){ms-md){md-mb) , (6.2.10) 

where J := 3m(ViiV22V\*2V2\) is the Jarlskog invariant which is invariant under 
left or right multiplication of by a diagonal matrix, i.e. an element of f/(l)^. 
The geometrical interpretation of the quantity J is given by the so-called unitarity 
triangles. These express the requirement on V to be unitary, so that for instance 

{vv^)u = Viiv;^ + Vi2v;2 + 1^131^23 = . (6.2.11) 

In the complex plane, the three complex numbers that sum to zero form the sides of 
a triangle. The absolute value \ J\ is twice the area of this triangle. Since there are 
different unitarity triangles corresponding to different elements of VV\ all with the 
same area, there are several ways of expressing J in terms of the elements of V . A 
general formula is given by jBH] 

J ^^P^^oki = MVajVpkV:kV;^) . (6.2.12) 

The quantities describing the CKM matrix which are invariant under rephasing of 
the quark fields are J and the absolute values \Vaj\. 

In the general case where m and m' are not assumed to be Hermitian, the corre- 
sponding quantity would be 

detC := det (-i [mm^mW"^]) (6.2.13) 

o T/ 2 2\f 2 2\f 2 2\f 2 2\f 2 2\f 2 2\ 
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which of course leads to the same conditions on the mass matrices as before (given 
that we took all masses to be positive before). In the literature, the use of C is 
perhaps more common than the use of C, and one may well argue that this second 
measure of CP violation should be considered more fundamental as its value does 
not depend on the arbitrary signs of the mass terms in the Lagrangian. 
Recall from chapter [5] the parametrisation of the CKM matrix fl5.1.6p 

^ cos y cos z cos y sin z e~™ sin y \ 

V = — cos X sin z — e"" sin x sin y cos z cos x cos z — sin x sin y sin z sin x cos y 

y sin X sin z — e"" cos x sin y cos z — sin x cos z — cos x sin y sin z cos x cos y J 

(6.2.14) 

where the ranges of the Euler angles x,y, z and the complex phase w are 

0<x,y,z<^, 0<w<2n. (6.2.15) 

In this parametrisation, the Jarlskog invariant J is given by 

J = -sin(2x) sin(2z) siny cos^y sintu. (6.2.16) 

It appears that the observed value for J is very small, since the maximal value 
would be ^ 0.1, whereas in Nature ^ 

J = 3.05+J]i|^ X 10"^ (6.2.17) 

This discrepancy in 3.5 orders of magnitude is the fine-tuning problem we are inves- 
tigating in this chapter. 

In a general discussion where the values of the quark masses are not fixed, J is 
not an appropriate measure of CP violation, since even with nonvanishing J one 
could have CP conservation if, for example, the up and charm quark masses were 
coinciding. It was suggested in fSD] to use an appropriately normalised form of det C, 
namely 

= ^^ (TrC^^3/2 (6-2-18) 

for three quark families as the unique basis independent measure of CP violation. 
This is a dimensionless number which takes values between —1 and +1, and is again 
observed to be very close to zero. When written out in terms of the CKM matrix 
parameters and quark masses, it is a rather complicated expression which is therefore 
not extremely useful in practical computations. In the present analysis, we assume 
the quark masses as known and regard J as the measure of CP violation. 
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6.3 Statistics on the Space of CKM Matrices 

Let us compute expectation values for J for different choices of measure on the double 
quotient f/(l)^\S'f/(3)/t/(l)^, parametrised by coordinates {x,y,z,w). Obviously, 
given a metric g, the expectation value of any function f{x,y,z,w) is 



As we mentioned in chapter one possible viewpoint of a given function on 
U{l)^\SU{3)/U{iy is to see it as function on the homogeneous space S'f/(3)/f/(l)^ 
which is independent of the coordinates parametrising the left f/(l)^ {p and q in 
our conventions). Since we also saw that there is a unique left-invariant measure on 
the homogeneous space, this gives a unique definition of expectation values of such 
functions. 

We computed this left-invariant measure on SU{3)/U{1)^ to be 



for some constant A^. 

Note that there is no need to pass from the double quotient to the homogeneous 
space if one regards the former as the fundamental domain of definition of a given 
function. One might equally well regard the left-invariance on the homogeneous space 
that has led to a unique measure as a spurious symmetry requirement. 

Remembering that the CKM matrix is defined as V = [/{U'Y in terms of two uni- 
tary matrices U and U' (denoted by Ul and f/^ for the general case of non-Hermitian 
mass matrices above), one might think about defining the Jarlskog invariant J in 
terms of coordinates on the Cartesian product of SU{3) x SU{3), the spaces of U 
and f/'li The natural measure on SU{3) x SU{3) is, in terms of left-invariant forms 
ai and a'^, 

^ = ai A ^2 A as A ^4 A as A as A ^7 A ag A o-'i A A 0-3 A 0-4 A 0-5 A ctq A 0-7 A CTg . (6.3.3) 

Since it is only V = UU''^ that enters into the CP violating parameters, one could 
consider U and V as independent variables, i.e. write U' = V'^U for some matrix V. 

^Ambiguities in the definition of U and U' mean tliat tfiey sfiould be regarded as living in quotient 
spaces sucfi as f/(l)^\5'C/(3), but since our left-invariant measures do not depend on the left and 
right C/(l)^ parameters such subtleties may be ignored. 



J dxdy dz dw ^/det g /(x, y, z, w) 



(6.3.1) 



J dx dy dz dw y/det g 



a/^ = A^sin(2x) sin(22;) siny cos^y 



(6.3.2) 
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Then the Maurer-Cartan form on the second SU (3) is 

i AX = U'^dU' = UUU - U\dV V^)U , (6.3.4) 

which gives a'^ = aa — habfb, where Tb are right- invariant forms on SU{3) in terms 
of V coordinates and hab only depends on the U coordinates. The measure fl6.3.3p . 
expressed in terms of V and U coordinates, is thus a product of a function of the 
U coordinates and the natural measure in V coordinates (left- and right-invariant 
forms on SU{3) give the same measure). Integration over the U coordinates then 
just gives an irrelevant constant, and one is left with the measure (16.3.2p on the 
space of V matrices. This justifies the use of (16.3.21) instead of the more complicated 
constructions obtained by reducing to the double quotient, and we will regard (I6.3.2p 
as the most natural choice of measure on the parameter space. 

For the measure (I6.3.2p the evaluation of the necessary integrals is very simple 
and we find that all odd powers of J average to zero, and 

( J2) = J-^ 1.389 X 10"^ , (J^) = — - — ^ 4.960 x 10"^ . (6.3.5) 
^ ^ 720 ' \ / 201600 ^ ^ 

Thus we find that AJ for the Jarlskog invariant is given by 

A J = ^ 0.0373 , (6.3.6) 

12^5 

which is about three orders of magnitude larger than the experimental value (16.2. 17p . 

We argued in section 15.31 that a geometrically motivated measure on the double 
quotient space arises from Kaluza-Klein reduction of a left-invariant metric on SU{3) 
or SU{3)/U{1)'^, and obtained the measure (I5.3.7P from the bi-invariant metric on 
SU{3). It is interesting how expectation values for powers of J compare to the 
previous calculations if this measure is used. Unfortunately, the expression for the 
measure is too complicated to allow us to perform the integrations analytically. Using 
numerical integration, we find that 

(J2) ~ 1.1161 X 10~^ (J^) ^ 3.750 X 10"% (6.3.7) 

with the odd powers of J again averaging to zero. Thus we find 

A J ^ /(J2y ^ 0.03341 , (6.3.8) 



which is very close to the previous result. 
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Naively, one might have thought that since J is independent of all the U{1) phases, 
the results would be the same whether one averaged over the space U{lY\SU (3)/f/(l)^, 
or else the flag manifold S'f/(3)/?7(l)^. Of course we know that this is not correct, 
since, as we have seen, the measure for the biquotient takes a different form than 
the measure one would obtain from just integrating over the left f/(l)^ parameters. 
Nevertheless, this does not seem to have a great impact on expectation values for J. 

We briefly mentioned in chapter E] that one could construct more general mea- 
sures on the double quotient by Kaluza-Klein reduction of more general left-invariant 
metrics on S'f/(3)/[/(l)^. While this makes the expression for the measure even more 
complicated, the effect on expectation values of J seems somewhat limited, unless 
one takes extreme values for the "squashing parameters". The general conclusion, 
namely that there appears to be a fine-tuning problem, is unchanged]^ 

Given that we argued that the measure used by Ozsvath and Schiicking (15.3. ip was 
not appropriate from a geometric viewpoint, we should compute expectation values 
for this measure also. We find that again (J) = and 

(J2) = 35 X 2-^6 ^ 5.341 X 10"^ , (J^) = 27027 x 2"^^ ^ 1.573 x 10"'^ , (6.3.9) 

and that the standard deviation is 



AJ2 = ^{J^) - (J2)2 = V22127 X 2"^^ ^ 1.135 x 10"^ , (6.3.10) 

and 



\/35 

AJ = A/(J2y = ^ ~ 0.02311 . (6.3.11) 
256 

Again, the results are rather similar to the previous cases. 

Assuming a uniform distribution over the angles, and hence treating the double 



coset as a fiat four-dimensional manifold so that the measure is simply y/det g = 1, 
would give 

(J) = 0, (J2) = ^— ^ 4.883 X 10"^ , (J^) = 189 x 2'^^ ^ 1.408 x 10"^ . 



2048 

Hence, this simplest possible choice gives 



(6.3.12) 



AJ?« 0.02210. (6.3.13) 



^Tliese calculations, mainly carried out by Chris Pope, are not included in detail in this thesis, 
but can be found in [57] . 
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We have to conclude that, although we argued that some choices of measure on 
the space of CKM matrices are better motivated geometrically than others, if one 
is only interested in predictions about the magnitude of J there seems to be little 
difference between all of the choices we have examined. In all cases one would expect 
the observed value of J to be at least of order 10~^, which leaves three orders of 
magnitude between prediction and observation. 

6.3.1 Fine-tuning of J 

In the previous subsection we saw that different measures on the space of mixing 
angles all seem to lead to expectation values for J which are about three orders of 
magnitude larger than the observed value. The value for J that we observe hence 
appears to be finely tuned. In this section we shall do a closer, mainly numerical, 
analysis of the fine-tuning involved. We compare results obtained by taking the 
S'[/(3)-invariant (descending from the left-invariant measure on S'f/(3)/t/(l)^) and 
Kaluza-Klein (obtained from Kaluza-Klein reduction of the bi-invariant metric on 
5'f/(3)) measures, which seem natural from a geometric perspective, with a uniform 
distribution which is just the simplest possible choice. 
The observed value for the Jarlskog invariant J is 

J ^ 10-^-51 (6.3.14) 

In order to obtain a probability distribution for J we have used Mathematica to 
numerically compute integrals of the form 

J dx dy dz dw \/ det g 9{a — \J\) 9{\J\ — b) = P{h < | J| < a) ■ j dxdydzdw a/ det g 

(6.3.15) 

using Monte Carlo methods. The 5'?7(3)-invariant measure and the Kaluza-Klein 
measure disfavour small values of J more strongly than a uniform distribution would. 
For example, we obtain 

Pfiagd J| < 10"') ^ 0.25% , Pkk(|^| < 10-') ^ 0.44% . (6.3.16) 

Taking a uniform distribution a/ det g = 1, we get 

Punif(|J| < 10-') ^7%. (6.3.17) 

In figures [6711 to 16 . 3 [ we have plotted the probability distribution of \ J\ logarithmically, 
together with a fit to a power law p(|t/|) oc \ J\^ assumed to be valid for small \ J\. 



Naturalness of CP Violation 



133 



p(log(|/|)) 




Figure 6.1: Probability distribution for log|J| using the S'C/(3)-invariant flag measure. 

The degree of fine-tuning required to reproduce a very small J is considerably higher 
if one uses the measure induced by a 5'[/(3)-invariant flag metric or the Kaluza-Klein 
metric, maybe contrary to what one might expect. Values of J close to its maximal 
value of ~ 0.0962 are disfavoured in both cases. This motivated the use of a 
logarithmic scale for \J\. 

p(log(|/|)) 




Figure 6.2: Probability distribution for log \J\ using the Kaluza-Klein measure. 

In all three cases the numerical results for small \J\ are well approximated by a power 
law of the form = a ■ \ J\^ for the probability density of \ J\. The logarithmic 

graphs show p(log | J|) oc For the S'?7(3)-invariant flag measure (Fig. 16. ip . the 

best fit to the data in the region below \ J\ = 10^^'^ or log \J\ = —5.3 is 



Afiag = -0.042(±0.006) , «flag = 18.1(±0.7) ; 



(6.3.18) 
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for the Kaluza-Klein measure (Fig. 16. 2p we fitted the data in the region below \ J\ = 
10"^'^ or log I J| = —6.2 and obtained 

Akk = -0.097(±0.008) , okk = 18.9(±1.0) ; (6.3.19) 

finally for the uniform measure (Fig. 16. 3p . the best fit to the data in the region below 
|J| = 10-3-4 or log|J| = -7.8 is 

Aunif = -0.500(±0.005) , Ounif = 3.51(±0.15). (6.3.20) 



p(log(|/|)) 




Figure 6.3: Probability distribution for log \J\ using a uniform distribution. 



6.3.2 Wolfenstein Parametrisation 

A different parametrisation of the CKM matrix which is frequently used in the particle 
physics literature was introduced by Wolfenstein and is based on the experimentally 
observed hierarchy 

y<^x<^z<^l (6.3.21) 
in the mixing angles. One rewrites |169] 

sin^ = A , sina; = AX"^ , sin ye"™ = AX^{p - ir/) (6.3.22) 

and treats A as a small parameter while A, p, and rj are supposed to be parameters 
of order unity. In the modern literature one also frequently uses p, fj instead of p and 
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T] because then the combination p + zr/ is independent of the phase convention in the 
CKM matrix [7]. These parameters are defined by 

'1- p- A^X^p' + ff) 



P 



1 - A2 (1 - A2A4p)2 + A^X^f]^ ' 



''"V 1-A2 (1 - A2A4p)2 + A^AV ■ ^^'^'^^^ 
The experimental values for X,A,p,f] are [zjl 

A = 0.2272 ± 0.0010, A = O.SlStofil, p = 0.221^°;°^^, f] = OMOtofi ■ (6.3.24) 

One viewpoint on the Wolfenstein parametrisation is that it is adapted to the values 
for the CKM matrix entries that we observe and has no deeper significance; but often 
the viewpoint is expressed that this parametrisation expresses some kind of "natural 
hierarchy" in the mixing angles coming from physics beyond the standard model (see 
£■9- [21] )• Treating the other parameters as "naturally of order unity" reduces our 
calculations to a one-dimensional problem as everything is only expanded in terms of 
A. We find that the S'[/(3)-invariant measure on the fiag manifold is now, to leading 
order in A, 

§|^[V««A3A"(1.A'^.0(A*)). (6.3.25) 

and the Jarlskog invariant J is 

A^f]X^{l - A^X^) (1 - A^ - 2A^pX^ - A^{ff + (p - 2)p)X'^ + A\ff + p2)A8) 

(1 - A2) (1 - 2A2pA4 + ^4(7^2 + p2)A8)2 

A^f]X^ + 0{X^^) . (6.3.26) 



J 



Inverting this expression to leading order gives the probability distribution for J 

J 




(6.3.27) 

which is incompatible with the numerical results. Trying to improve this approximate 
result by letting A, p and fj take all possible values leads to inconsistencies since the 
expansion in powers of J contains poles of arbitrary order in A. From our present 
viewpoint, where no mechanism for fixing these parameters close to one is known, 
the Wolfenstein parametrisation seems rather misleading when discussing geometric 
probability. 



"Note that only even powers of A appear in all expansions, so that it is « 0.05 which is the 
small parameter. 
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6.4 Statistics on the Space of Mass Matrices 

In the previous sections we have focussed on ?7(l)^\S'?7(3)/f/(l)^, the space of CKM 
matrices, as the space of CP violating parameters. Since S'f/(3) is compact, this 
space has finite volume for a natural measure. But the CKM matrix is derived from 
the quark mass matrices, which could be viewed as more fundamental and more 
directly determined by physics beyond the standard model. In this section, we try 
to obtain statistics of the Jarlskog invariant J from a random distribution on the 
space of mass matrices; in the standard model, this is the space of 3 x 3 Hermitian 
matrices, whereas in certain extensions with left-right symmetry one has to take all 
3x3 complex matrices into account. 

In this section, as suggested in [89], we define a dimensionless mass matrix M by 
M = m/A, where m is the quark mass matrix in the notation of section [6^ and A 
is a scale which may be chosen for convenience. A natural choice would be A = 
for the up-type or A' = for the down-type quarks, but since we in principle allow 
arbitrary values for the quark masses we leave A arbitrary. 

We now need to pick a measure on the space of mass matrices in order to do 
statistics. Let us summarise the main results of section 15. 4t For a given Hermitian 
matrix M, one defines a unitary matrix U and a real diagonal matrix D by 

M^ = U^DU, (6.4.1) 

and the natural measure, invariant under conjugation by U{3), on the space of such 
matrices was found to be 

DMn := {Di - D2f{Di - D:if{D2 - Ds)^ sin(2a;) cos^y sin?/ sin(2z) 

^dDidD2dD^dx dy dz dw dr dt . (6.4.2) 

We noted that to avoid overcounting one should integrate over times a quotient 
of SUiK), more precisely {U{lf x &^)\SU{'i). 

Let us contrast this with the space of all complex matrices; here we used the 
representation 

M, = uIdUr, (6.4.3) 

and identified the relevant space one has to integrate over as M.^ x {U{1)^ x &3)\{U (3) x 
U{3)). We then argued that if we are only interested in functions that depend on Ul, 
but not on Ur, such as the Jarlskog invariant J, we can integrate over the space of 
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Ur. Then the resulting measure on x (t/(l)^ x ©3)\5't/(3) is 

DM^ := W{Dl - D]f\DiD2D^\ sin(2x) cos^y siny sin(22) 

xdDidD2dD^dx dy dz dw dr dt , (6.4.4) 

which obviously differs from (16.4.21) by different powers of the real numbers Di that 
had the role of eigenvalues for Hermitian matrices. Since the measure (16.4.41) is 
invariant under Di — t- — -Dj, we will integrate over all of for simplicity. This only 
leads to an extra factor 8 which drops out in expectation values. 

It is clear that in both cases the integrals over 'E? with the given measures will 
diverge. We could introduce a cutoff for the quark masses, but then any expectation 
values for quark masses would strongly contradict observation, as there is no way to 
explain the observed mass hierarchy. 

To both make the integrals converge and introduce a possibility for introducing the 
quark masses as additional parameters into our distribution, we introduce a function 
decaying sufficiently fast for large \Di\ into the measure. A natural and simple choice 
is a Gaussian. 

Since there are actually two integrations over the space of mass matrices, corre- 
sponding to two mass matrices M and M' for the up-type and down-type quarks, we 
will use the measur 

DM DM' exp(-Tr(MMU) - Tr(M'(M')U')) 
= DM DM' exp(-Tr(f/"fD2f/A) - Ti{{U')\D'fU'j^)) , (6.4.5) 

which for complex matrices is still invariant under the two right U (3) actions on the 
complex matrices M and M', but the invariance under the left f/(3) actions is broken 
to the subgroups commuting with A and A' respectively. We will assume A and A' 
to be Hermitian with non-negative eigenvalues, satisfying [A, A] = and use the 
redefinitions U — )■ UW, U' — )■ U'W, which leave J invariant, to diagonalise A and A'. 
Hence we can assume A and A' to be diagonal with positive entries in the following, 
and the subgroup commuting with A and A' is f/(l)^. For Hermitian matrices, (I6.4.5P 
becomes 

DMu DM^ exp(-Tr(M2 A)) exp(-Tr((M^)2A)) , (6.4.6) 

** DAI is to replaced by DMn and DMc for Hermitian and complex matrices respectively, and from 
now on we write U instead of Ul in the case of complex matrices as well, since Ur has disappeared 
from our calculations. 
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where the Gaussian breaks the symmetry of the measure from invariance under 
(?7(3) X ?7(3)) to invariance under {U{lY x ?7(1)^), again the subgroup commuting 
with A and A' . 

We will find that this symmetry breaking is necessary to obtain a distribution 
that can reproduce the observed quark masses. Our proposal is to fit the diagonal 
matrices A and A' to the observed quark masses and to use the resulting probability 
distribution for statistics of J. 

An expectation value of a quantity such as is then given bjlll 

(J2) = N j DMDM'e-^'^^'^''^^-'^'^^^'^'"^'^j\M,M') (6.4.7) 

= n[ dDdD' [ jy^jy^>^-TriD^UAU^)-TriiDrU'A'U'^)j2_ 

Here DU and DU' are the measures on {U{iy x e3)\SU{3), given by f!5.1.12p . 

DU := sin(2a;) siny cos^ ?/ sin(2z) , (6.4.8) 

and 



dD :-- 



{Di - D2Y{Di - Dsf{D2 - D^Y dDi dD2 dD^ , M Hermitian , 

(Dj - Dlf{Dl - Dlf{Dl - Dlf\DiD2Dz\ dD^ dD2 dD^ , M complex , 

(6.4.9) 

and the normalisation factor is defined by 



1:=/ dDdD' f jjjjjjjj^^-TriD^UAU^)-Tri(DrU'A'U'^)_ ^g4 10) 

From J = 3m{Vu V22 V^2 ^2*1) and V = UU'\ we have 

3 

J{U,U')= MUlaU2hUlU;^U'\jj'*2,U'2cU\d). (6.4.11) 



a,b,cA=l 



At this point, it is perhaps instructive to note that choosing A proportional to the 
identity would split the integral f l6.4.7p into a product of an integral over M^, i.e. 
the quark masses, which just gives a constant, and an integral of over ((f/(l)^ x 
<5^)\SU . Since all even powers of J are invariant under the 63 action on U and 
[/', this can be replaced by an integral over (f/(l)^\S'f/(3))^ if averages are concerned. 
By the arguments presented in section 16.31 a change of coordinates reduces this to a 



^^^In all cases considered, all odd powers of J again have expectation value zero. 
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single integration over a copy of 5'f/(3)/f/(l)^, and one recovers the results of section 



16.31 for expectation values of powers of J . The introduction of more general diagonal 
matrices A and A' means that the maximal invariance of the measure DM DM' is 
broken down to the action of a subgroup. 

It should be clear from (16.4.71) that multiplying A (or A') by a constant is the 
same as rescaling the eigenvalues Di (or D'^) and so amounts to a rescaling of A (or 
A'). We can therefore, without any loss of generality, choose 

/lo o\ /in n\ 



A 





l/iil 

yo I / III I 








(6.4.12) 



where fic, fJ^u, fJ's, and fid are dimensionless parameters that we are free to choose so 
as to reproduce the observed quark masses as expectation values. (In the case of an 
exponential exp(— Tr(D^74)), these would of course be equal to the respective quark 
masses, expressed in units where A = rrit and A' = rrib-) Because of experimental un- 
certainties in the up and quark masses, one can modify this distribution to reproduce 
different values for these masses. 



6.4.1 Hermitian Mass Matrices 

We shall now present the result of evaluating the integral (16.4.71) . first for Hermitian 
and then for complex mass matrices, and therefore for the two different choices of 
measure (16.4.91) . We will have to use various approximations which we try to justify in 
the following, and the final result is obtained independently by numerical integration 
and by an analytical approximation, where we find relatively good agreement between 
the two. 

The need for approximations in the integral (16.4. 7p is immediate since the expres- 
sion for J in terms of coordinates on ((f/(l)^ x &3)\SU{3)y is too complicated to be 
given explicitly. However, since 

Tr{D'UAU^) = 5^/^^$^ A|f/.,r =: E^'^- Tr((D')'^'^'^'') =: 

a c a a 

(6.4.13) 

with 

= Ai cos^ y cos^ z + A2 cos^ y sin^ z + A^ sin^ y , 
^[ = A[ cos^ y' cos^ z' + A'^ cos^ y' sin^ z' + A3 sin^ y' , (6.4. 14) 
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and we assume ^3 ^ 1 and A3 ^ 1 , the integrand is negligibly small unless y ~ and 
~ 0. We use this to approximate the integrals over y and y' (recall the restriction 
of the coordinates on U {lY\SU {?>) given by fl5.4.9l) which restricts from f/(l)^\S'f/(3) 
to {U{lf X 63)\5f/(3)): 

arctan(sin x) arctan(sin a;') 



/ 









00 00 



\y=y'=0 




1 



-Ti{D^UAU^)-Tr{{D')^U'A'U'^) j2 



r{U,U'))\,^^. (6.4.15) 



It turns out that this is independent of w and w'. Constant prefactors such as 
1/4743^43 appearing in both numerator and denominator can be dropped, and so we 
have 

^'^^'^ J^,dDdD'Jd^xd^x'S2,S2,S2,'S2,' (e-Tr(D2C/AC/t)-Tr((D')^C/M'C7't)) I 

where S2x = sin2x etc. and 



\y=y'=0 

(6.4.16) 



7r/4 7r/2 2tt 2it 

d^x = dx dz dr dt (6.4.17) 



0000 



and similarly for J d'^x' . 

Now we can integrate over both copies of in (16.4. 16p , using 



00 00 



j dD,J dD2 j dD, (Dl-D2)'(/^l-I^3)'(^2-/^3)V«^^^«^^^««^i 



-00 —00 —00 



= ,,5/2,5/2,5/2 + ^3) + ^2(^1 + ^3) + ^1(6 + - 2^16^3) . (6.4.18) 

?2 ^3 

The explicit expression for J at y = = is 

J{U, U')\.^^ = \s2.S2.' {cl.s\s,, sin(3f + t) + c^c,.^ sin(3f - t) (6.4.19) 



\y=yl=Q ^ 

~(?z^zSz'{(?z' [sin(3f + t ) + sin(3f — 3t)] — s^/ sin(3f + t)) 
+CzCzis\[(?^i sin(3f — t ) — s^/ [sin(3f + 3t) + sin(3f — ^)] )} 
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where Sx = sinx, c^/ = cosz', etc., f = r — r', and i = t — t'. Integrating (16.4. 19[) over 
r, r', t, and t' indeed gives zero, which is why we choose to use J^. 

We need to determine the parameters appearing in the matrices A and A' in 
(16.4. 12p by fitting them to the expectation values for quark masses that we want to 
reproduce. We first observe that expectation values for squared mass matrices take 
the relatively simple form 

7r/4 tt/2 



J^.dDDl J dx J dz sin2xsin2z fe-^'^'^'^^^'A , 
{DD ^ ^ ^ —— . (6.4.20) 

^ 1' 7r/4 Tv/2 ^ ' 

J^.dD J dx J dz sin 2a; sin 2z (g-TrCi^^t^Af/t)) | 





The denominator of f l6.4.20p is explicitly 

7r/4 tt/2 

/• , /• , 37r3/2 
Id '■= dx a^; sin 2x sin 2z — - x 

(^1 (6 + 6) + ^2(6 + ^3) + + 6) - 2666) , (6.4.21) 
where within our approximation the quantities 6 are taken at y = 0, 

6 = ^1 cos^ z + A2 sin^ 2; , 6 = ^1 cos^ x sin^ 2; + cos^ x cos^ z + A^ sin^ x , 

6 = ^1 sin^ X sin^ + sin^ a; cos^ z + A3 cos^ a; , (6.4.22) 

with ^3 3> ^2 S> Ai. We notice that all 6 ci^^^ nonzero for all values of x and z. 
Furthermore, the integral is dominated by very small x and z (we cannot have x = |), 
and we can approximate Id well by only keeping the terms of leading order in x and 
z in the trigonometric functions, and 

6'(6 + 6) + 6'(6 + 6) + 6'(6 + 6) - 2666 ^ ^3^' + ^3^2 , (6.4.23) 



which are the leading terms (as we shall see, the first of these is effectively also of 
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order A\A2): 

7r/4 7r/2 



37r3/2 



j dx j dzAxz {A^x^ + AlA2){Ai + A2z')~'''^{A2 + A^x^)-'''^A~''/^ 



oo oo 

^jdX I dZ{AlX + AlA,)iA, + A,Z)-'/\A2 + AsX)-'/'Af/' 



oo 

Svr^/' r .... .o ,.._./o 2 



y dX{AlX + AlA2)iA2 + A3X)-5/2^3 5/2 _ 



3Af^A2 



37r3/2 2 



oo 

3/2 , ,^2 + ldX^^{A, + A,X)-''' 





7r3/2 /2 1/9 ,/9 4 \ 7r3/2 
' -A, ^/^A- + -^7^ = -w-w-^ . (6.4.24) 



Similarly, we find for the numerator of (I6.4.20p 

157r3/2 



oo oo 



In{Dl) ^ jdX j dZ{AlX + AlA,){A, + A2ZY"\A, + A.xyi^A-,'/' 



oo 

^^""'^^ 1 rfX (A^X + AlA2){A2 + AaX)-^/^^- • ^ 

(6.4.25) 



^ 
37r3/2 



4Af A^/^A^/^ ' 



hence 



{DD ^ ^ . (6.4.26) 



Redoing the same calculation for D2 and gives 

There is a relative factor of 3 which has to be taken into account when determining 
A and A. 

Because of the dependence of masses on the energy scale in quantum field theory, 
described by the renormalisation group, there is some ambiguity in what is meant by 
the "quark masses" we want to reproduce. Following [144] . for example, we take all 
the quark masses evolved to the scale of the Z boson mass. These are given in |170] : 

(m„, m^, nit) = (1.27t[!i^ MeV, 0.619 ± 0.084 GeV, 171.7 ± 3.0 GeV) ; 

{md,ms,mb) = (2.90;};?^ MeV, 55^}^ MeV, 2.89 ±0.09 GeV) . (6.4.28) 
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We use the central values 

(m„,mc,mt) := (1.27 MeV, 0.619 GeV, 171.7 GeV) ; 

(mrf, m„ mb) := (2.9 MeV, 55 MeV, 2.89 GeV) . (6.4.29) 

The mass scales A and A' that were so far arbitrary are fixed by setting {DD = 
(m^/A)^ and {{D'^)"^) = {rrih/A')'^. By comparing the results obtained by numerical 
integration with the values we want to reproduce, we can then fix the parameters 
/ic,/i«,/is and /Xrf. 

In the case of the positively charged top, charm and up quarks, which exhibit 
a more extreme quark mass hierarchy, we find that numerical calculations (using 
Mathematica) reproduce the results we have obtained analytically very well (see 
Table [6?T1) . For the negatively charged quarks, we find numerically that we have to 
use relative factors different from 3 to reproduce the observed masses. Comparing 
the numerical results with 06.4.291) . we fix the parameters appearing in A and A' to 

/ \ 2 / \ 2 



Q / N 2 

6 I 



^ 3.90 X 10-^ 








^ 5.43 X 10-\ 


/^d = 


12 







-10 



. ~ 5.43 X 10-4 , ^l= — l'^\^ 2.42 x 10"^ . (6.4.30) 

2 \mbj 5 \mbj 

As a brief side remark, we see that the dominant contributions to these integrals 
come from the regions 

[A^ [M 

(6.4.31) 

and these values are all small compared to one. This observation allows us to give 
rough estimates for magnitudes of individual elements of the CKM matrix. 

In the standard convention the ordering of the quark families is [u, c, t) and not 
{t,c,u) as used in f l6.4.12p . which means that in our parametrisation, 

\{UU'^)u\ = \Vt,\ , \{UU'^)i2\ = \Vts\ , \{UU'^)23\ = \VJ . (6.4.32) 

Since all of the numbers in fl6.4.3ip are small, we only keep leading terms in the angles 
on U and U': 

\{UU'^)is\ ~ \x'{z' -z)- e™'?/' + •••!- x'z' ^fid^2x lO'^ , 

I(f/f/'"^)i2| -0.02, |(f/f/'"^)23| ^x' ^ — ^ 0.07. (6.4.33) 
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Experimental values are [7] 

\Vtd\ = (8.14l°:32) X 10-3 , \Vts\ = (41.6ll°:^2) ^ ^ i^^^i ^ 0.2271^°:°°}° . 

(6.4.34) 

Our rough estimates reproduce the right ordering of the three parameters and are 
accurate to factors of order a few. A more careful analysis would involve computing 
expectation values for these parameters in the distribution we have assumed. 

We return to the task of computing the expectation value of J^. In order to obtain 
an analytical expression, we use the fact that the main contribution to the integral 
(16.4. 16p will come from small z to only take the term in (16.4.191) that is nonzero at 
z = 0. Averaging over r, t, r',and t' gives a factor of 1/2, as one might have expected, 
and therefore we use 

t2 -'-•2 2 -2 / 2 / 2/-4/ /r A or:.\ 

•^smaii2 •= 2^^^ a; COS xsm x cos x cos z sm z (6.4.35) 

for our calculations. The numerator of (16.4. 16p is the product (using again that only 
small z contributes) 

7r/2 7r/2 

Qtt^ [ 2z f , ?,m2z' cos^ z' sin^ z' 

J IA^TA^zW J {A,cos^z' + A,sin'zr/' ^ ^ ^ ^ 



7r/4 

^ sin 2x sin^ x cos^ x{Al cos^ x sin^ x + A'^A2{cos^ x + 2 cos^ x sin^ x + sin^ x)) 
{A2 cos^ X + A3 sin^ a;)^/^(A3 cos^ x + A2 sin^ x)^/^ 







times an integral identical to the second line of (16.4.361) . except that A2 and A3 
are replaced by A'^, and AL The first two factors are — — and — y= — 7^= — 
respectively; for the second line of (I6.4.36P we change variables to X = cos^ x to 
obtain 
1 



1/2 



3 



{A2X + Asil - X))5/2(A3X + ^2(1 - X))5/2 




(6.4.37) 
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where we are dropping corrections of order Putting everything together, we obtain 



xj^arctan^-^ ,6.4.38) 

ruu rud rUb ( rrir 2m, 

arctan 



15 



2 

3 rrta 



5 1 32 niii 





X I arctan ■, , 

32 TTid V 5 

where the numerical factors appearing in the last line come from the different factors 
chosen in f l6.4.30p . Note that the top quark mass does not appear in this approximate 
result. 

We also use numerical integration to check the validity of our analytical approxi- 
mations. In these calculations we use both the simplified expression J^^^ji ^ and the 
expression for J given in (16.4. 19p . We find that for the first quantity, the numerically 
evaluated expectation value (^smaii z) about 7/6 of (I6.4.38p . and the numerical result 
for (J^) (taken at y = y' = 0) is 

(J2) ^ 5.28 X 10-^ (6.4.39) 

which gives 

A J = /(J2y ~ 7.27 X 10"^ (6.4.40) 

which is much closer to the observed value than any of the previously obtained results. 
Assuming a Gaussian distribution for J which is peaked at zero, the probability of 
finding a small J, in the sense of Sec. 16.3. H is now 

Pmass(|J|<10-')~83%, (6.4.41) 

whereas the probability of finding a J which is even smaller than the observed value 
is 

Pmassd^l < 3 X 10-^) ^ 32% . (6.4.42) 

The observed value for J can no longer be viewed as being finely tuned if the distri- 
bution used in our calculations is assumed. 
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Table 6.1: Analytical and numerical results for integrals of interest. 





J. U \\JV\^L Uj ^ ( 








A Ti PI 1 vf ip/^ 1 ypmi If 


1.43 X 10~^^ 


2.14 X 10^21 


2 78 X 10-26 


1.17 X 10-31 


Numerical result 


1.43 X 10-21 


2.14 X 10-21 


2.78 X 10-26 


1.18 X 10-31 


Quantity 


(over x', z') 








Analytical result 


1.32 X 10-13 


1.99 X 10-13 


3.60 X 10-1^ 


1.60 X 10-19 


Numerical result 


1.32 X 10^13 


1.98 X 10^13 


7.28 X 10^1^ 


1.99 X 10-19 


Quantity 


Id (over x, z, x', z') 


( P ) 

\ small z/ 






Analytical result 


1.89 X 10-34 


3.22 X 10-^ 






Numerical result 


1.88 X 10-34 


3.73 X 10-9 


5.28 X 10-9 





To test the sensitivity of our results to changes in the parameters, we take values 
at the upper or lower limit in fl6.4.28p and try to find the highest and lowest values 
for (J2). We find that setting 

(m„,mc,m() := (0.85 MeV, 0.535 GeV, 174.7 GeV) ; 

(md, m^, mb) := (1.71 MeV, 40 MeV, 2.98 GeV) (6.4.43) 



gives 



and 



whereas setting 



gives 



and 



(J2) ^ 1.86 X 10-9 (6.4.44) 

AJ = v/(J2y ^ 4.31 X 10-^ (6.4.45) 

(m„,mc,mf) := (1.77 MeV, 0.535 GeV, 168.7 GeV) ; 

(mrf,m„mb) := (4.14 MeV, 71 MeV, 2.8 GeV) (6.4.46) 

(J2) ~ 1.52 X 10"^ (6.4.47) 



AJ= /(J2y^ 1.23 X 10-^ (6.4.48) 

While the result for A J depends on the choice of parameters, we see that the general 
conclusion, which is a comparison of orders of magnitude, is unchanged by reasonable 
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modifications of tlie parameters [i.e. values for tfie quark masses). Even tlie greatest 
possible value for A J is significantly lower than any of the values obtained in previous 
sections. 

In this subsection, we have established that assuming the observed hierarchy in 
quark masses in a Gaussian distribution over the space of mass matrices gives expec- 
tation values for which are small enough to regard the observed value as "natural" 
and not finely tuned. This statistical observation seems to open up the possibility 
that the same mechanism that is responsible for the apparently unlikely hierarchy in 
quark masses might also explain why the observed value for \ J\ is so small. 

A more detailed analysis including a probability density for \J\ for this distribution 
is left to future work, since the numerical methods used here do not give sufficiently 
accurate results. 

6.4.2 Complex Mass Matrices 

The corresponding calculation for general complex mass matrices differs from the 
one for Hermitian mass matrices detailed in the previous subsection by a different 
measure over M®, the space of possible quark masses. Hence after the integration 
over has been performed, the resulting integrands will be different functions of 
the quantities introduced in the previous section. The steps of calculating an 
expectation value for will be the same as in the previous subsection, and we will 
use similar analytical approximations as well. It suffices therefore to highlight those 
steps where the resulting formulae differ from the previous ones. 

In the previous subsection we concluded that a natural measure on the space 
of Hermitian mass matrices that involves the observed quark masses reproduces the 
observed magnitude oi CP violation very well. The quark mass hierarchy had the 
effect of significantly lowering the expectation value of J^. Looking at the different 
measures fl6.4.9p . we would expect this effect much stronger if general complex mass 
matrices are considered. This expectation will be confirmed: There now appears to 
be fine-tuning in the opposite way, i.e. one would expect CP violation to be much 
smaller than is observed. 

Using the same approximation of small y and y' as previously, we are again left 
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with computing an integral of the form 

4e dDdD' J d'xd'x' S^,S2.S2.,S2., (^g-Tr(D^C/AC/t)-Tr((D')^f/'A'f/'t) j2^^^ j 



y=y'=0 



dDdD' J d'xd^x' S2.S2.S2.,S2.^ ^^-T^iD^U AUr)-TriiD'YU' A'U'^)^^ |^^^^^^ 

(6.4.49) 

where 

7r/4 7r/2 2tt 2tt 

d'^x = dx dz dr dt (6.4.50) 





and similarly for J d'^x'. The symbol dD of course now has a different meaning. Again 
we can integrate over both copies of M.^ in fl6.4.49p . using 

oo 

/66€3 := / dD^dD,dD,{Dl-Dl)\Dl-Dl)\Dl-Dlf\D^D,D,\e-^^^'^-^^^^^^^^ 

— CO 

24 

= 757575 (2(e?e2' + + ClCim + + - - eiCs - 6^3) (6.4.51) 

-366^3(^1 + ^2 + il - & - el6 - - ili2 - ili^ - &) - seie'el) • 

To fix the parameters appearing in the matrices A and A', we again use expectation 
values for squared quark masses: 



J^sdDDj J dx J dz sin2xsin2;2(e-^(^'^^^')) 

i^'i) ^t^2 ■ (6-4-52) 

^3^^^ J dx J dz sin 2a; sin 2z (e-TrC^^^i/Ai/t)) |^^^ 





The denominator of f l6.4.52p is explicitly 

7r/4 n/2 



Id ■= J '^•'^ J dz sm2xsm2z fi^-^^^^.^ , (6.4.53) 



where ^1,^2,^3 at ?/ = were defined in (16.4.221) . Again, we can approximate Id well 
by only keeping the terms of leading order in x and z in the trigonometric functions, 
and approximating /^i^aCs by the leading term 

24 48 

" mm " {A + A,z^nA, + A,x^rA, ' 
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which leads to 



7r/4 tt/2 

48 



A 



j dx j dzAxz {Ai + A2z'^)'^ {A2 + A3X 



00 00 

^ JdX j dZ {Ai + A2Z)-\A2 + A^X)-^ 


48 1 1 6 



2\-3 



A3 AA2Ai 2A3AI A\AlAl 



(6.4.55) 



The result is very well reproduced by numerical calculations. Similarly, 

00 00 

Id{DI) ^ '^JdxJdZiA, + A2Z)-%A2 + AsX)-' 





240 1 1 24 



A3 5A2AI 2A3AI AIAIA3 



2 ' 



(6.4.56) 



hence 



{Dl) ^ i- . (6.4.57) 



Redoing the same calculation for D2 and D3 gives 

(Dl)^^^, {Dl)^^^. (6.4.58) 

The relative factors, which are now slightly different from the ones obtained in 
fl6.4.27p . have to be taken into account when determining A and A'. 
Recall that we used the following central values for the quark masses: 

(m„,mc,mt) := (1.27 MeV, 0.619 GeV, 171.7 GeV) 

(mrf, m„ mfe) := (2.9 MeV, 55 MeV, 2.89 GeV) . (6.4.59) 

We now proceed as before, using numerical calculations to check our analytical 
approximations. As before, the analytical approximations are better reproduced for 
the positively charged top, charm and up quarks, which exhibit a more extreme quark 
mass hierarchy, than for the negatively charged quarks. We use the numerical results 
to fix the parameters in A and A' to 

2 



u2 = 2 — ^ 2.60 X 10"^ , al = Ai — ] ^ 2.19 x 10^^° , 

Km J Km J 

2 



^il=(^] ^3.62xl0-^ ij,1 = a(^] ^4.03x10-^ (6.4.60) 
\mbj \mbj 
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In order to obtain an analytical expression for expectation value of J^, we again 
take the approximation of small z 

t2 ^ ■ 2 2-2 / 2 / 2/-4/ /'r A n\ 

^smaiiz ■= 2^^^ a; COS x sm X COS X COS z sm z (o.4.olj 

for our calculations. Within this approximation for J, still taking /^i^aCs ~ 48^]^ ^^2^ ^.^3"^, 
the numerator of (I6.4.49P is the product (using again that only small z contributes) 

tt/2 7r/2 

f . 2z /" , / sin 2z' cos^ sin"^ 
1152 X / dz-- — r-T777 X dz — K — — 

J (^1 + ^2^2)5/2 J {AiCOS^ z' + A2sin^ z')^ 



7r/4 

f , sin 2x sin^ x cos^ x 







(y42 cos^ X + ^3 sin x)'^(y43 cos^ X + ^2 sin x) 



times the integral in the second line with all quantities replaced by "primed" ones. 
The first two factors are j^^j^ and i2(A')2(^/-)3 ; respectively; for the other two (which 
have the same form) we change variables to X = cos^ x to obtain 
1 

{A2X + A,{l-XmAsX + A2il-X)) = 2ip; - a7 + ^ iiA^/A^f) ] ^ 



1/2 

putting everything together, we obtain 



(6.4.63) 



where the numerical factors appearing in the last line come from the different fac- 
tors chosen in f l6.4.60p . Just as before, the top quark mass does not appear in this 
approximation. This compares with the scaling behaviour obtained in the previous 
section, 

{JL.11.)- % • 6.4.65 

For numerical calculations we use both the simplified expression Js^iaiiz 
expression for J given in fl6.4.19p . We find that for the first quantity, the numerically 
evaluated expectation value, (Jsmaiiz) ~ 1-89 x 10~^^, is about 94% of fl6.4.64p . and 
the numerical result for (J^) (taken at y = y' = 0) is 

(J^) ^ 2.07 X 10-^^ (6.4.66) 

which gives 

AJ = /(J2y ~ 4.55 X 10"^ (6.4.67) 
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which is now almost three orders of magnitude smaller than the observed value. 
Assuming a Gaussian distribution peaked at zero, we now get 

P(|J| < 10-^) ^ 97%, (6.4.68) 

When the measure presented here is used, there seems to be extreme fine-tuning in 
J, but now we would say that one observes unnaturally large CP violation! This 
result may look surprising, given that the maximal value for J is around 0.1 and the 
observed value just 3 x 10"^, but it shows how strongly the quark mass hierarchy 
suppresses large values of J in our distribution. 



6.5 Extension to Neutrinos 

In the standard model, neutrinos are taken to be massless. There are no Yukawa 
couplings to the Higgs field, and no possibility of CP violation. However, since the 
phenomenon of neutrino oscillations has been confirmed experimentally, neutrinos 
must be assumed to have mass. Curiously, a mixing matrix for leptons, now referred 
to as the Maki-Nakagawa-Sakata (MNS) matrix, was already proposed in 1962 
|114] . It appears in a charged current that has the same form as fl6.2.7p for baryons, 
with massive leptons taking the place of up-type quarks and neutrinos taking the place 
of down-type quarks. With compelling evidence for neutrino oscillations [7J and hence 
massive neutrinos, such an interaction is now assumed to exist. By diagonalising the 
leptonic mass terms one will again encounter a mixing matrix M |95j . 

The mathematical problem is therefore the same as before; the only outstanding 
issue is the ambiguity in the definition of this matrix due to possible rephasing of the 
lepton fields. There are two different possible types of mass terms for fermions. In 
the most commonly used notation, fermion fields are given by complex 4-component 
(Dirac) spinors, consisting of two 2-component (Weyl) spinors: 

^= (^^'j, ^:=^V = (^| 4)- (6.5.1) 
Normally, a mass term contains and its Dirac conjugate ip, thus coupling ipi and 

-Cmass = mi)^) = m{iplipi + iplip2)- (6.5.2) 

In this case, the four complex components of ip represent a particle together with its 
antiparticle, which have the same mass but are otherwise distinct. It is however also 
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possible to have Majorana mass terms, which are of the form 

^CMajorana = mitpl^i + 1712^1^)2- (6.5.3) 

Here mi and m2 are not necessarily equal. In the case mi 7^ m2, the 4-spinor 
represents two different particles, which are each equal to their own antiparticle. It 
is then appropriate to take real and imaginary parts and rewrite the theory in terms 
of two real 4-component (Majorana) spinor fieldslH 

These fields, being real, have no rephasing freedom, and the matrix relating the 
flavour and mass eigenbases for neutrinos is in S'0(3). Therefore, for an MNS matrix 
which couples three Majorana neutrinos to the massive leptons, known to be Dirac 
spinors, there is only the freedom of rephasing on one side, and this matrix would be 
an element of the single quotient SU {?>) / U (l)"^ . Of course, all intermediate cases are 
also conceivable, where some neutrinos have Majorana and others have Dirac masses. 
The most general mixing matrix is hence of the form U{1)^\SU{N) /U{1)^^^ , for 
some k between and — 1. 

There are at present no strong constraints on the magnitude of CP violation from 
observation. The results presented in this chapter may possibly put constraints on it, 
if there is some experimental information about neutrino masses, or conversely make 
near-coincident neutrino masses appear more or less likely, if the magnitude of CP 
violation is known. 



6.5.1 Neutrino Mixing Matrix 

If we assume that the neutrinos are Majorana spinors, the lepton mixing matrix be- 
longs to U {lY\SU {?>) , since only phasing of the lepton charge eigenstates (z/g, z/^, i/,-), 
but not the mass eigenstates (i^i, V2, t's) is possible. The former are defined by 











= M 


V2 









In other words, \ua) is the neutrino state created in the decay + where l^, 

is an antilepton of flavour a = e, /i, r [7]. Such a state is in general a superposition of 

tt There is of course no need for a theory formulated in terms of Majorana spinors to have an even 
number of fields, since the Dirac spinor should not be regarded as fundamental. The presentation 
only started from the Dirac spinor field because it is most commonly encountered in the literature. 
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mass eigenstates which will have different propagation amplitudes, leading to nonzero 
probabilities for neutrino oscillation Va ^ ^p- 

One conventionally fixes the phases so that M takes the form 
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(6.5.5) 

where the three angles ^12, ^13, and ^23 lie in the first quadrant. 

The Jarlskog invariant for the neutrino mixing matrix is defined as in f l6.2.16p but 
with {x,y,z,w) now parametrising the MNS matrix M. Note, in particular, that it 
is independent of the phases cti and a2- 

Experimentally, parameters of the neutrino mixing matrix are not completely 
known. According to [7], 

sin2(2^i2) = 0.87 ± 0.03 , 0.92 < sin^(2e23) < 1 , sin2(2^i3) < 0.19 , (6.5.6) 

and there is no experimental information about the Dirac angle 6. Thus, we can cer- 
tainly deduce that there is an upper bound on the Jarlskog invariant for the neutrino 
mixing matrix, given by 

|J|< 0.049. (6.5.7) 

For six different neutrino mass eigenstates, as in the see-saw mechanism, which 
are assumed to all couple to charged leptons, a general mixing matrix would be an 
element of U{1)^\SU{6), since one would diagonalise a 6 x 6 Hermitian matrix. The 
present viewpoint is however that such additional neutrino fields would be "sterile" 
and not couple to either W or Z bosons [7]. 



6.5.2 Statistics of J 

We have seen that the parameter space for the neutrino mixing matrix is the six- 
dimensional single quotient f/(l)^\5'f/(3), and that the Jarlskog invariant for the 
neutrino mixing matrix takes the same form fl6.2.16p as it does for the CKM matrix. 
Therefore, all results obtained in section apply equally to the case of neutrinos. 



For completeness, we quote the results obtained in section 1631 for a right-invariant 
measure on the homogeneous space U{iy\SU{3), 

(J2) = — ^ 1.389 X 10"^ , (J^) = — i — ^ 4.960 x 10"^ , (6.5.8) 
^ ^ 720 ' \ / 201600 ' ^ ^ 
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and 

A J = ^ 0.0373 . (6.5.9) 

12^5 

This can be compared with the experimental bound given in fl6.5.7p . 

One should also repeat the calculations of section 16. 4[ assuming particular values 
for the neutrino masses (and including the known masses for the charged leptons). 
A strong hierarchy in the neutrino masses would then presumably again lead to 
"naturally" small CP violation from the corresponding mixing matrix. Alternatively, 
an experimental observation of small CP violation for neutrinos would perhaps be 
an indication of a mass hierarchy in neutrinos. At present, neither the magnitude 
of CP violation nor any values of neutrino masses have been measured sufficiently 
accurately to allow predictions. 



6.6 Summary and Outlook 

In this chapter, we analysed the problem of finding a natural measure on a space of 
coupling constants, which in our case was the space of CKM matrices, the double 
quotient f/(l)^\S'[/(3)/f/(l)^. We saw that the measure on this double quotient is 
nonunique, and we analysed several possible choices of measure on the double quo- 
tient. One class of measures was given by squashed Kaluza-Klein measures, induced 
by a Kaluza-Klein reduction of a left-invariant metric on the fiag manifold. Alterna- 
tively, one could take the unique measure on S'f/(3)/f/(l)^ and simply integrate over 
the left angles. The measure used by Ozsvath and Schiicking seemed not to be very 
well motivated from a geometric perspective. 

When calculating expectation values for J, we found that all of the measures we 
considered led to rather similar results. In each case, the observed value was about 
three orders of magnitude below what one would normally expect; the observed value 
appears to be finely tuned. The same applied to the Ozsvath-Schiicking measure, 
an extremely squashed Kaluza-Klein measure, or a fiat measure, which is just the 
simplest choice and not justified geometrically. 

In section [631 we adopted the different viewpoint that the CKM matrix should not 
be viewed as separate from the quark masses, but that it is really the mass matrices 
which are "chosen" by a yet unknown physical mechanism. We took the observed 
values for the quark masses as an input and chose the simplest distribution which 
was able to reproduce these observed values, while inducing a different measure on 
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the space of CKM matrices. While this is a choice we made, and all results depend 
on this choice, our measure is the combination of a maximally symmetric measure, 
invariant under a redefinition of a Hermitian matrix by conjugation by a unitary 
matrix, and a Gaussian incorporating the observed vahies of the quark masses. We 
would have to make additional rather strong assumptions to motivate a different 
choice of measure that would differ appreciably from this simple construction. It may 
well be that such assumptions are justified by the underlying mechanism determining 
the mass matrices, but we do not know of such a mechanism yet. Assuming such a 
distribution, together with the assumption that the mass matrices can be taken as 
Hermitian as is possible in the standard model, we found that the observed value of 
J now appears very natural and not finely tuned at all. In this statistical approach, 
regarding the Yukawa couplings determining the mass matrices as randomly chosen 
seems more appropriate than separating quark masses and mixing angles. 

In order to test the dependence of this nice result on the assumption of Hermitian 
mass matrices, we then repeated the calculation under the assumption that there is 
a left-right symmetry which implies that the quark mass matrices can not in general 
be taken to be Hermitian. We constructed the analogous probability distribution on 
the space of 3 x 3 complex matrices, again fitting four free parameters to expectation 
values for quark masses that can reproduce the observed values. We saw that using 
such a probability distribution the conclusion one reaches is rather different: One 
would now expect J to be about three orders of magnitude smaller than the observed 
value. Hence, there is a fine-tuning problem; without further assumptions, a fun- 
damental theory leading to a left-right symmetric electroweak sector at low energy 
should generically be expected to reproduce very weak CP violation. Invoking the 
principle of Occam's razor, "entia non sunt multiplicanda praeter neces- 
SITATEM," we would hke to conclude that, only looking at possible explanations for 
the magnitude of CP violation in the electroweak sector, the standard model should 
be preferred to left-right symmetric extensions such as Pati-Salam: In the latter one 
needs additional assumptions on the fundamental parameters that resolve the issue 
of observing "unnaturally large" CP violation, that are not necessary in the standard 
model, or any extension of it that allows a restriction to Hermitian mass matrices. 

In the general context of this thesis, these calculations provide an example of gauge 
symmetries, described by one or two copies of SU (2), whose presence or absence has a 
direct effect on measurable quantities, although we have followed a purely statistical 



Naturalness of CP Violation 



156 



approach rather than proposing dynamical mechanisms that could lead to certain 
values for these quantities. 

Although we have tried to argue that our results are independent of renormalisa- 
tion group flow since the relevant quantities do not run strongly with energy scale, 
there is another more subtle issue: The low-energy limit of a left-right symmetric ex- 
tension with non-Hermitian mass matrices would still be the standard model, where 
mass matrices can be assumed to be Hermitian, leading us back to the measure we 
considered previously. It would be desirable to incorporate this dependency of the 
assumptions one has used to construct the measure on energy scale into the analy- 
sis, namely to use a measure which depends on energy scale also. A starting point 
would be a quantification of "non-Hermiticity" that could then flow from zero at low 
energies to some non-zero value at high energies. At present these ideas are however 
somewhat vague, so that we will have to leave them to exploration in future work. 

We have argued that our analysis also applies to the case of massive neutrinos, 
where the predictions will conceivably be tested by future experiments. We have seen 
that for Majorana spinors, the Maki-Nakagawa-Sakata matrix |114] which appears is 
naturally an element of the single quotient f/(l)^\S'f/(3). Since the right phases do 
not play any role in neutrino oscillations and the relevant J is independent of these 
phases, the calculations are identical to the ones presented here, although with the 
appropriate values of the /i parameters appearing in A and A' . Based on our results, 
one would expect a strong hierarchy in neutrino masses to lead to suppressed CP 
violation. 

In the seesaw mechanism one adds very heavy right-handed neutrinos, and the 
most general mixing matrix (making the additional, at present experimentally unjus- 
tifled, assumption that the heavy neutrinos couple to charged leptons as well) would 
be an element of U{lY\SU{Q). This is naturally a Kahler manifold, and the measure 
induced by the Kahler metric can be obtained from the analysis in [136]. We leave a 
detailed treatment of this case, following our approach here, to future work. 

Finally, one could analyse the effects of a fourth generation of quarks on CP viola- 
tion by repeating the calculations for 4x4 Hermitian matrices. If this generalisation 
spoils the agreement with the observed J, one might obtain interesting lower bounds 
on the masses of a hypothetical extra generation of quarks. 
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Summary and Outlook 

"La recherche de la verite doit etre le hut de notre activite; 
c'est la seule fin qui soit digne d'elle." ^TJ^ 

Henri Poincare 

In this thesis we have investigated several examples within general relativity and 
particle physics where symmetries, and more precisely the geometry of symmetry 
groups and homogeneous spaces, played a fundamental role. In each of these cases, the 
study of geometric aspects of the underlying symmetries led to physical predictions, 
providing illuminating examples for the interplay between mathematical structures 
and the physical world that modern theoretical physics is based on. 

The first part consisting of chapters [2] to H] was concerned with deformed and 
constrained symmetries in relativity. 

In chapter [2], we studied exact solutions of general relativity in four dimensions 
from a group-theoretic viewpoint, focussing on their global symmetries (isometrics). 
The isometry groups of these spacetimes act simply-transitively; the spacetimes are 
homogeneous spaces G/H, where H was either trivial or one-dimensional. We applied 
the theory of deformations of Lie groups and algebras to these isometry groups, 
finding that some solutions are related by deformation of their isometry groups, and 
that these algebraic properties are partly related to physical properties; we have at 
least found indications that the Kaigorodov-Ozsvath solution contains closed timelike 
curves (CTCs) just as the Petrov solution which it is a deformation of. This work, 
which focussed on a few concrete relatively straightforward examples, leaves much 
scope for extension. First, one could go to higher dimensions, since in four dimensions 
all homogeneous solutions are more or less known. Second, one could investigate what 
the generic relationship is between algebraic properties (related to isometry groups) 
and physical properties (such as causal behaviour); one might be able to relate a 
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classification of group deformations to a classification of physical properties of certain 
classes of exact solutions. We leave this to further work. 

In chapter [31 we considered a formulation of general relativity as a gauge theory 
of the de Sitter group, shifting our focus to local symmetries. This formulation, an 
extension of the MacDowell-Mansouri formulation given by Stelle and West, replaces 
the perhaps more usual Poincare group by the de Sitter group, and provides another 
example of group deformation. While the viewpoint on general relativity as some kind 
of gauge theory is not new, we argued how Einstein-Cartan-Stelle-West theory may 
provide a "deformation" of general relativity analogous to the framework of deformed 
special relativity (DSR), and focussed on Minkowski space as an explicit example, 
thereby uncovering close relations between two apparently disconnected branches of 
the general relativity literature. We showed how the apparent "noncommutativity of 
spacetime" given by noncommuting translations on the internal ("momentum") de 
Sitter space is manifest in torsion of the connection. We gave estimates for the torsion 
one would get in the simplest situation, and discussed worrying physical consequences 
if electromagnetic fields couple to torsion, such as breaking of charge conservation. 
While this means that the framework we discussed is presumably not viable as a 
physical theory, it provided an interesting alternative interpretation of DSR. 

Our study was an application of Cartan geometry which is a generalisation of 
(pseudo-)Riemannian geometry in that the "tangent" space is replaced by an arbitrary 
homogeneous space G/H. It would be worth formulating other examples of such a 
situation in physics in this mathematical language. One example is the traditional 
formulation of loop quantum gravity (LQG) where the so(3, l)-valued connection is 
reduced to an so (3)- valued connection by partial gauge fixing. A recent study [3H] 
seems to generalise this gauge choice by essentially working in a Cartan geometric 
construction, but the precise relation is rather unclear to us. 

In chapter H] we studied yet another reformulation of general relativity, as a topo- 
logical BF theory with constraints. Instead of deforming a symmetry, we were con- 
straining it; the large symmetry on the phase space of the topological theory which 
implies the (local) equivalence of all solutions to the equations of motion could be 
constrained to be only the gauge symmetry of general relativity in four dimensions, 
where not all solutions are locally equivalent. Such a formulation, which is popular in 
covariant approaches to quantising gravity, has so far mainly been given in terms of 
quadratic constraints, which suffer from ambiguities in their solution whose relevance 
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in a quantum theory is not entirely clear. It is therefore an improvement if, as we 
have shown, an alternative formulation in terms of linear constraints can be found. 
This formulation seems closely related to discrete constraints on representations that 
are used in spin foam models, but requires in its corresponding classical formulation 
the introduction of a new "4D closure" constraint that has not been studied very 
much so far. An obvious extension of our work would be to study the effects of the 
presence of such a new constraint on spin foam models for quantum gravity. Linear 
constraints, whose solutions are formally given by projections, are also more closely 
related to the group field theory approach to quantum gravity, where constraints are 
usually enforced as projections |121] . 

The second part consisting of chapters O and El was a geometric study of gauge 
symmetries in the electroweak model and their role in CP violation. 

In chapter |5]we introduced a class of metrics on the group S'f/(3), the homogeneous 
space ^f/(3)/[/(l)2 and the double quotient space U{lf\SU{'i)/U{lY. While for the 
first two cases the SU{'i) left group action provided a clear criterion (left-invariance) 
for preferred classes of metrics, the situation for the double quotient was less clear 
and several possible choices were introduced. We also computed bi-invariant metrics 
(with respect to the natural group actions) on the space of Hermitian and on the 
space of complex 3x3 matrices. 

In chapter |6] we saw how CP violation in the electroweak sector is determined by 
the Jarlskog invariant J which can be viewed as a function on [/(l)^\S't/(3)/f/(l)^, 
the space of Cabibbo-Kobayashi-Maskawa (CKM) matrices. We tried to give a math- 
ematically precise answer to the question whether the observed magnitude of J, which 
appears to be rather small, should be considered natural or fine-tuned. Since finding 
this answer is equivalent to specifying a natural measure on the double quotient space, 
we analysed several possible choices of the measure induced by metrics presented in 
the previous chapter. While it seems not entirely clear which of these metrics should 
be preferred, we found that they all suggest fine-tuning in the magnitude of J. We 
then reformulated the problem by considering the space of quark mass matrices in- 
stead, using a measure which incorporates the observed quark masses and modifies 
the induced distribution on the space of CKM matrices. The effect of this modified 
distribution was the following: If the mass matrices can be taken to be Hermitian 
as in the standard model, we found that the input of observed quark masses is suf- 
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ficient to reproduce the observed magnitude oi CP violation to a remarkably high 
accuracy. If on the other hand one takes general complex mass matrices, one finds 
an "inverse" fine-tuning problem; the magnitude of J should be expected to be much 
smaller than is observed. This distinction is physically relevant since in models with 
extended left-right symmetry, such as the Pati-Salam model, one has to consider 
general complex mass matrices. Our results, which rely on assuming the simplest 
and geometrically best motivated probability distributions, seem therefore to suggest 
that the standard model should be preferred by Occam's razor: One does not need 
additional fine-tuning assumptions to explain the magnitude of CP violation. Hence, 
these calculations provide another example of how the modification of symmetries 
(the absence or presence of a left-right symmetry) leads to different physical (in this 
case statistical) predictions. 

Similar calculations could be performed for the case of four quark generations, 
where the mass of a fourth quark would have an effect on the probability distribu- 
tions we have considered. Another important extension of our results is to the case 
of neutrinos, where the formalism for CP violation is rather similar and the mathe- 
matics essentially the same. Our results would suggest that a large mass hierarchy 
in neutrinos (together with the known hierarchy in lepton masses) makes highly sup- 
pressed CP violation hkely, whereas on the other hand large CP violation might 
suggest that near-coincident neutrino masses appear more likely. Such predictions 
can obviously be tested in experiments such as those at LHC. 
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